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This invention relates to nucleic acid and amino acid sequences of cell signaling proteins 
and to the use of these sequences in the diagnosis, treatment, and prevention of cell proliferative 
and inflammatory disorders. 



Signal transduction is the process of biochemical events by which cells respond to 
extracellular signals. Extracellular signals are transduced through a biochemical cascade that 
begins with the binding of a signal molecule such as a hormone, neurotransmitter, or growth 
factor, to a cell membrane receptor and ends with the activation of an intracellular target molecule. 
The process of signal transduction regulates a wide variety of cell functions including cell 
proliferation, differentiation, and gene transcription. 

Signal transduction is the general process by which cells respond to extracellular signals 
(hormones, neurotransmitters, growth and differentiation factors, etc.) through a cascade of 
biochemical reactions that begins with the binding of the signaling molecule to a cell membrane 
receptor and ends with the activation of an intracellular target molecule. Intermediate steps in this 
process involve the activation of various cytoplasmic proteins by phosphorylation via protein 
kinases and the eventual translocation of some of these activated proteins to the cell nucleus where* 
the transcription of specific genes is triggered. Thus, the signal transduction process regulates all 
types of cell functions including cell proliferation, differentiation, and gene transcription. 

Protein kinases play a key role in the signal transduction process by phosphorylating and 
activating various proteins involved in signaling pathways. The high energy phosphate which 
drives this activation is generally transferred from adenosine triphosphate molecules (ATP) to a 
particular protein by protein kinases and removed from that protein by protein phosphatases. 
Phosphorylation occurs in response to extracellular signals, cell cycle checkpoints, and 
environmental or nutritional stresses. Protein kinases are roughly divided into two groups: those 
that phosphorylate tyrosine residues (protein tyrosine kinases, PTK) and those that phosphorylate 
serine or threonine residues (serine/threonine kinases, STK). A few protein kinases have dual 
specificity for serine/threonine and tyrosine residues. Almost all kinases contain a similar 250-300 
amino acid catalytic domain containing specific residues and sequence motifs characteristic of the 
kinase family. (Hardie, G. and Hanks, S. (1995) The Protein Kinase Facts Books, Vol 1:7-20 
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Academic Press, San Diego, CA.) 

STKs include the second messenger dependent protein kinases such as tiie cyclic-AMP 
dependent protein kinases (PKA), which are involved in mediating hormone- induced cellular 
responses; calcium-calmodulin (CaM) dependent protein kinases, which are involved in regulation 
of smooth muscle contraction, glycogen breakdown, and neurotransmission; and the mitogen- 
activated protein kinases (MAP) which mediate signal transduction from, the cell surface to the 
nucleus via phosphorylation cascades. Altered PKA expression is implicated in a variety of 
disorders and diseases including cancer, thyroid disorders, diabetes, atherosclerosis, and 
cardiovascular disease. (Isselbacher, K.J. et al. (1994) Harrison's Principles of Internal Medicine . 
McGraw-Hill, New York, NY, pp. 416-431, 1887.) 

PTKs are divided into transmembrane, receptor PTKs and nontransmembrane, non- 
receptor PTKs. Transmembrane protein-tyrosine kinases are receptors for most growth factors 
which include epidermal GF, platelet-derived GF, fibroblast GF, hepatocyte GF, insulin and 
insulin-like GFs, nerve GF, vascular endothelial GF, and macrophage colony stimulating factor. 
Non-receptor PTKs lack transmembrane regions and, instead, form complexes with the 
intracellular regions of ceil surface receptors. Receptors that function through non-receptor PTKs 
include those for cytokines, hormones (growth hormone and prolactin) and antigen-specific 
receptors on T and B lymphocytes. 

Many of these PTKs were first identified as the products of mutant oncogenes in cancer 
cells where their activation was no longer subject to normal cellular controls. In fact, about one 
third of the known oncogenes encode PTKs, and it is well known that cellular transformation 
(oncogenesis) is often accompanied by increased tyrosine phosphorylation activity. (Charbonneau 
H and Tonks NK (1992) Annu Rev Cell Brol 8:463-493.) 

Protein phosphatases regulate the effects of protein kinases by removing phosphate groups 
from molecules previously activated by kinases. The two principle categories of protein 
phosphatases are the protein phosphatases (PPs) and the protein tyrosine phosphatases (PTPs). 
PPs dephosphorylate phosphoserine/threonine residues and are important regulators of many 
cAMP-mediated hormone responses in cells. (Cohen, P. (1989) Annu. Rev. Biochem. 58:453- 
508.) PTPs reverse the effects of protein tyrosine kinases and play a significant role in cell cycle 
and cell signaling processes. (Charbonneau and Tonks, supra. ) In the process of cell division, for 
example, a specific PTP (M-phase inducer phosphatase) plays a key role in the induction of 
mitosis by dephosphorylating and activating a specific PTK (CDC2) leading to cell division. 
(Sadu, K.. et al. (1990) Proc. Natl. Acad. Sci. 87:5139-5143.) 

Guanine nucleotide binding proteins (GTP-binding proteins) are critical mediators of the 
signal transduction pathway. Extracellular ligands such as hormones, growth factors. 
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neuromodulators, or other signaling molecules bind to transmembrane receptors, and the signal is 
propagated to effector molecules by intracellular signal transducing proteins. Many of these signal 
transduction proteins are GTP-binding proteins which regulate intracellular signaling pathways. 
GTP-binding proteins participate in a wide range of other regulatory functions including 
metabolism, growth, differentiation, cytoskeletal organization, and intracellular vesicle transport 
and secretion. Exchange of bound GDP for GTP followed by hydrolysis of GTP to GDP provides 
the energy that enables GTP-binding proteins to alter their conformation and interact with other 
cellular components. Two structurally distinct classes of GTP-binding proteins are recognized: 
heterotrimeric GTP-binding proteins, consisting of three different subunits, and monomeric, low 
molecular weight (LM W), GTP-binding proteins consisting of a single polypeptide chain. 

G protein coupled receptors (GPCR) are a superfamily of integral membrane proteins 
which transduce extracellular signals. GPCRs include receptors for biogenic amines, mediators of 
inflammation, peptide hormones, and sensory signal mediators. A GPCR becomes activated when 
the receptor binds to its extracellular ligand. The beta subunit of the GPCR, which consists of an 
amino-terminal helical segment followed by seven WD, or p transducin repeats, transduces signals 
across the plasma membrane. Conformational changes in the GPCR, resulting from the ligand- 
receptor interaction, promote the binding of GTP to the GPCR intracellular domains. GTP binding 
to the GPCR leads to the interaction of the GPCR alpha subunit with adenylate cyclase or other 
second messenger molecule generators. This interaction regulates the activity of second - ' 
messenger molecules such as cAMP, cGMP, or eicosinoids which, in turn, regulate 
phosphorylation and activation of other intracellular proteins. The GPCR changes conformation 
upon hydrolysis of the bound GTP by GTPases, dissociates from the second messenger molecule 
generator, and returns to its initial pre-ligand binding conformation. 

G beta proteins, also known as p transducins, contain seven tandem repeats of the WD- 
repeat sequence motif, a motif found in many proteins with regulatory functions. WD-repeat 
proteins contain from four to eight copies of a loosely conserved repeat of approximately 40 
amino acids which participates in protein-protein interactions. Mutations and variant expression 
of p transducin proteins are linked with various disorders. Mutations in LISl, a subunit of the 
human platelet activating factor acetylhydrolase, cause Miller-Dieker lissencephaly. RACKl 
binds activated protein kinase C, and RbAp48 binds retinoblastoma protein. CstF is required for 
polyadenylation of mammalian pre-mRNA in vitro and associates with subunits of cleavage- 
stimulating factor. CD4, an integral membrane glycoprotein which functions as an HIV co- 
receptor for infection of human host cells is degraded by HIV-encoded Vpu in the endoplasmic 
reticulum. WD repeats of human beta TrCP molecule mediate the formation of the CD4- Vpu, 
inducing CD4 proteolysis (Neer, E.J. et al. (1994) Nature 371 :297-300 and Margonin, F. et al. 
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Irregularities in the GPCR signaling cascade may result in abnormal activation of 
leukocytes and lymphocytes, leading to the tissue damage and destruction seen in many 
inflammatory and autoimmune diseases such as rheumatoid arthritis, biliary cirrhosis, hemolytic 
5 anemia, lupus erythematosus, and thyroiditis. Abnormal cell proliferation, including cyclic AMP 
stimulation of brain, thyroid, adrenal, and gonadal tissue proliferation is regulated by G proteins. 
Mutations in subunits have been found in growth-hormone-secreting pituitary somatotroph 
tumors, hyperfunctioning thyroid adenomas, and ovarian and adrenal neoplasms (Meij, J.T.A. 
(1996) Mol. Cell. Biochem. 157:31-38; AusseL C, et al. (1988) J. Immunol. 140:215-220). 

10 LMW GTP-binding proteins regulate cell growth, cell cycle control, protein secretion, and 

intracellular vesicle interaction. They consist of single polypeptides which, like the alpha subunit 
of the heterotrimeric GTP-binding proteins, are able to bind to and hydrolyze GTP, thus cycling 
between an inactive and an active state, LMW GTP-binding proteins respond to extracellular 
signals from receptors and activating proteins by transducing mitogenic signals involved in 

15 various cell functions. The binding and hydrolysis of GTP regulates the response of LMW GTP- 
binding proteins and acts as an energy source during this process (Bokoch, G. M. and Der, C. J. 
(1993) FASEB J. 7:750-759). 

At least sixty members of the LMW GTP-binding protein superfamily have been 
identified and are currently grouped into the four subfamilies of ras, rho, arf, sari, ran, and rab. 

20 Activated ras genes were initially found in human cancers and subsequent studies confirmed that 
ras function is critical in determining whether ceils continue to grow or become differentiated. 
Other members of the LMW G-protein superfamily have roles in signal transduction that vary with 
the function of the activated genes and the locations of the GTP-binding proteins that initiate the 
activity. Rho GTP-binding proteins control signal transduction pathways that link growth factor 

25 receptors to actin polymerization, which is necessary for normal cellular growth and division. The 
rab, arf, and sari families of proteins control the translocation of vesicles to and from membranes 
for protein localization, protein processing, and secretion. Ran GTP-binding proteins are located 
in the nucleus of cells and have a key role in nuclear protein import, the control of DNA synthesis, 
and cell-cycle progression (Hall, A. (1990) Science 249:635-640; Barbacid, M. (1987) Ann. Rev 

30 Biochem. 56:779-827; and Sasaki, T. and Takai, Y. (1998) Biochem. Biophys, Res. Commun. 
245:641-645). 

LMW GTP-binding proteins are GTPases which cycle between a GTP-bound active form 
and a GDP-bound inactive form. This cycle is regulated by proteins that affect GDP dissociation, 
GTP association, or the rate of GTP hydrolysis. Proteins affecting GDP association are 
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represented by guanine nucleotide dissociation inhibitors and guanine nucleotide exchange factors 
(GEP). The best characterized is the mammaiian homologue of the Drosophila Son-of-Sevenless 
protein. Proteins affecting GTP hydrolysis are exemplified by GTPase-activating proteins (GAP). 
Both GEP and GAP activity may be controlled in response to extracellular stimuli and modulated 
by accessory proteins such as RalBPl and POBl. The GDP-bound form is converted to the 
GTP-bound form through a GDP/GTP exchange reaction facilitated by guanine 
nucleotide-reieasing factors. The GTP-bound form is converted to the GDP-bound form by 
intrinsic GTPase activity, and the conversion is accelerated by GAP (Ikeda, M. et al. (1998) J. 
Biol. Chem. 273:814-821;Quilliam, L. A. (1995) Bioessays 17:395-404.). Mutant Ras-family 
proteins, which bind but can not hydrolyze GTP, are permanently activated, and cause cell 
proliferation or cancer, as do GEP that activate LMW GTP-binding proteins (Drivas, G. T. et al. 
(1990) MoL Cell. Biol. 10:1793-1798; and Whitehead, I. P. et al. (1998) Mol Cell Biol. 
18:4689-4697.) 

The discovery of new cell signaling proteins and the polynucleotides encoding them 
satisfies a need in the art by providing new compositions which are useful in the diagnosis, 
prevention, and treatment of cell proliferative and inflammatory disorders. -^^ 



The invention features substantially purified polypeptides, cell signaling proteins, referred 
to collectively as "CSIGP" and individually as CSIGP-l through CSIGP-13 . In one aspect, the- 
invention provides a substantially purified polypeptide comprising an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-13, and fragments thereof 

The invention further provides a substantially purified variant having at least 90% amino 
acid identity to at least one of the amino acid sequences selected from the group consisting of SEQ 
ID NO: 1-13, and fragments thereof The invention also provides an isolated and purified 
polynucleotide encoding the polypeptide comprising an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-13, and fragments thereof. The invention also includes an 
isolated and purified polynucleotide variant having at least 70% polynucleotide sequence identity 
to the polynucleotide encoding the polypeptide comprising an amino acid sequence selected from 
the group consisting of SEQ ID NO: I - 13, and fragments thereof. 

Additionally, the invention provides an isolated and purified polynucleotide which 
hybridizes under stringent conditions to the polynucleotide encoding the polypeptide comprising 
an amino acid sequence selected from the group consisting of SEQ ID NO: 1-13, and fragments 
thereof The invention also provides an isolated and purified polynucleotide having a sequence 
which is complementary to the polynucleotide encoding the polypeptide comprising the amino 
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acid sequence selected from the group consisting of SEQ ID NO: 1-13, and fragments thereof 

The invention also provides an isolated and purified polynucleotide comprising a 
polynucleotide sequence selected from the group consisting of SEQ ID NO: 14-26, and fragments 
thereof The invention further provides an isolated and purified polynucleotide variant having at 
least 70% polynucleotide sequence identity to the polynucleotide sequence selected from the 
group consisting of SEQ ID NO: 1 4-26 and fragments thereof. The invention also provides an 
isolated and purified polynucleotide having a sequence which is complementary to the 
polynucleotide comprising a polynucleotide sequence selected from the group consisting of SEQ 
ID NO: 14-26 and fragments thereof. 

The invention also provides a method for detecting a polynucleotide in a sample 
containing nucleic acids, the method comprising the steps of (a) hybridizing the complement of the 
polynucleotide sequence to at least one of the polynucleotides of the sample, thereby forming a 
hybridization complex; and (b) detecting the hybridization complex, wherein the presence of the 
hybridization complex correlates with the presence of a polynucleotide in the sample. In one 
aspect, the method further comprises amplifying the polynucleotide prior to hybridization. 

The invention further provides an expression vector containing at least a fragment of the 
polynucleotide encoding the polypeptide comprising an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-13, and fragments thereof In another aspect, the expression 
vector is contained within a host cell. 

The invention also provides a method for producing a polypeptide, the method comprising 
the steps of: (a) culturing the host cell containing an expression vector containing at least a 
fragment of a polynucleotide under conditions suitable for the expression of the polypeptide; and 
(b) recovering the polypeptide from the host cell culture. 

The invention also provides a pharmaceutical composition comprising a substantially 
purified polypeptide having the amino acid sequence selected from the group consisting of SEQ 
ID NO: 1-13, and fragments thereof, in conjunction with a suitable pharmaceutical carrier. 

The invention further includes a purified antibody which binds to a polypeptide selected 
from the group consisting of SEQ ID NO: 1 - 1 3, and fragments thereof The invention also 
provides a purified agonist and a purified antagonist to the polypeptide. 

The invention also provides a method for treating or preventing a disorder associated with 
decreased expression or activity of CSIGP, the method comprising administering to a subject in 
need of such treatment an effective amount of a pharmaceutical composition comprising a 
substantially purified polypeptide having the amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-13, and fragments thereof, in conjunction with a suitable 
pharmaceutical carrier. 



BNSOOCID: <WO 995e558A2_l_> 



wo 99/58558 



PCT/US99/10S67 



The invention also provides a method for treating or preventing a disorder associated with 
increased expression or activity of CSIGP, the method comprising administering to a subject in 
need of such treatment an effective amount of an antagonist of a polypeptide having an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-13, and fragments thereof, in 
5 conjunction with a suitable pharmaceutical carrier. 

BRIEF DESCRIPTION OF THE TABLES 

Table 1 shows nucleotide and polypeptide sequence identification numbers (SEQ ID NO), 
clone identification numbers (clone ID), cDNA libraries, and cDNA fragments used to assemble 
10 full-length sequences encoding CSIGP. 

Table 2 shows features of each polypeptide sequence including potential motifs, 
homologous sequences, and methods and algorithms used for identification of CSIGP. 

Table 3 shows the tissue-specific expression patterns of each nucleic acid sequence as 
determined by northern analysis, d iseases, disorders or conditions associated with these tissues, 
15 and the vector into which each cDNA was cloned. 

Table 4 describes the tissues used to construct the cDNA libraries from which Incyte ^^i. 
cDNA clones encoding CSIGP were isolated. 

Table 5 shows the programs, their descriptions, references, and threshold parameters used 
to analyze CSIGP. 

20 

DESCRIPTION OF THE INVENTION 

Before the present proteins, nucleotide sequences, and methods are described, it is 
understood that this invention is not limited to the particular machines, materials and methods 
described, as these may vary. It is also to be understood that the terminology used herein is for the 
25 purpose of describing particular embodiments only, and is not intended to limit the scope of the 
present invention which will be limited only by the appended claims. 

It must be noted that as used herein and in the appended claims, the singular forms "a," 
"an," and "the" include plural reference unless the context clearly dictates otherwise. Thus, for 
example, a reference to "a host cell" includes a plurality of such host cells, and a reference to "an 
30 antibody" is a reference to one or more antibodies and equivalents thereof known to those skilled 
in the art, and so forth. 

Unless defined otherwise, all technical and scientific terms used herein have the same 
meanings as commonly understood by one of ordinary skill in the art to which this invention 
belongs. Although any machines, materials, and methods similar or equivalent to those described 
35 herein can be used to practice or test the present invention, the preferred machines, materials and 
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methods are now described. All publications mentioned herein are cited for the purpose of 
describing and disclosing the cell lines, protocols, reagents and vectors which are reported in the 
publications and which might be used in connection with the invention. Nothing herein is to be 
construed as an admission that the invention is not entitled to antedate such disclosure by vinue of 
5 prior invention. 
DEFINITIONS 

"CSIGP" refers to the amino acid sequences of substantially purified CSIGP obtained 
from any species, particularly a mammalian species, including bovine, ovine, porcine, murine, 
equine, and preferably the human species, from any source, whether natural, synthetic, 
to semi-synthetic, or recombinant. 

The term "agonist" refers to a molecule which, when bound to CSIGP, increases or 
prolongs the duration of the effect of CSIGP. Agonists may include proteins, nucleic acids, 
carbohydrates, or any other molecules which bind to and modulate the effect of CSIGP. 

An "allelic variant" is an alternative form of the gene encoding CSIGP. Allelic variants 

15 may result from at least one mutation in the nucleic acid sequence and may result in altered 
mRNAs or in polypeptides whose structure or function may or may not be altered. Any given 
natural or recombinant gene may have none, one, or many allelic forms. Common mutational 
changes which give rise to allelic variants are generally ascribed to natural deletions, additions, or 
substitutions of nucleotides. Each of these types of changes may occur alone, or in combination 

20 with the others, one or more times in a given sequence. 

"Altered" nucleic acid sequences encoding CSIGP include those sequences with deletions, 
insertions, or substitutions of different nucleotides, resulting in a polynucleotide the same as 
CSIGP or a polypeptide with at least one functional characteristic of CSIGP. Included within this 
definition are polymorphisms which may or may not be readily detectable using a particular 

25 oligonucleotide probe of the polynucleotide encoding CSIGP, and improper or unexpected 
hybridization to allelic variants, with a locus other than the normal chromosomal locus for the 
polynucleotide sequence encoding CSIGP. The encoded protein may also be "altered," and may 
contain deletions, insertions, or substitutions of amino acid residues which produce a silent change 
and result in a functionally equivalent CSIGP. Deliberate amino acid substitutions may be made 

30 on the basis of similarity in polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the 
amphipathic nature of the residues, as long as the biological or immunological activity of CSIGP is 
retained. For example, negatively charged amino acids may include aspartic acid and glutamic 
acid, positively charged amino acids may include lysine and arginine, and amino acids with 
uncharged polar head groups having similar hydrophilicity values may include leucine, isoleucine, 

35 and valine; glycine and alanine; asparagine and glutamine; serine and threonine; and 
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phenylalanine and tyrosine. 

The terms "amino acid" or "amino acid sequence'' refer to an oligopeptide, peptide, 
polypeptide, or protein sequence, or a fragment of any of these, and to naturally occurring or ' 
synthetic molecules. In this context, "fragments," "immunogenic fragments," or "antigenic 
5 fragments" refer to fragments of CSIGP which are preferably at least 5 to about 1 5 amino acids in 
length, most preferably at least 14 amino acids, and which retain some biological activity or 
immunological activity of CSIGP. Where "amino acid sequence" is recited to refer to an amino 
acid sequence of a naturally occurring protein molecule, "amino acid sequence" and like terms are 
not meant to limit the amino acid sequence to the complete native amino acid sequence associated 
10 with the recited protein molecule. 

"Amplification" relates to the production of additional copies of a nucleic acid sequence. 
Amplification is generally carried out using polymerase chain reaction (PGR) technologies well 
known in the art. 

The term "antagonist" refers to a molecule which, when bound to CSIGP, decreases the 

15 amount or the duration of the effect of the biological or immunological activity of CSIGP. 

Antagonists may include proteins, nucleic acids, carbohydrates, antibodies, or any other molecules 
which decrease the effect of CSIGP. > 
The term "antibody" refers to intact molecules as well as to fragments thereof, such as- 
Fab, F(ab')2, and Fv fragments, which are capable of binding the epitopic determinant. Antibodies 

10 that bind CSIGP polypeptides can be prepared using intact polypeptides or using fragments 

containing small peptides of interest as the immunizing antigen. The polypeptide or oligopeptide 
used to immunize an animal (e.g., a mouse, a rat, or a rabbit) can be derived from the translation of 
RNA, or synthesized chemically, and can be conjugated to a carrier protein if desired. Commonly 
used carriers that are chemically coupled to peptides include bovine serum albumin, thyroglobulin, 

15 and keyhole limpet hemocyanin (KLH). The coupled peptide is then used to immunize the animal. 
The term "antigenic determinant" refers to that fragment of a molecule (i.e., an epitope) 
that makes contact with a particular antibody. When a protein or a fragment of a protein is used to 
immunize a host animal, numerous regions of the protein may induce the production of antibodies 
which bind specifically to antigenic determinants (given regions or three-dimensional structures on 

0 the protein). An antigenic determinant may compete with the intact antigen (i.e., the immunogen 
used to elicit the immune response) for binding to an antibody. 

The term "antisense" refers to any composition containing a nucleic acid sequence which 
is complementary to the "sense" strand of a specific nucleic acid sequence. Antisense molecules 
may be produced by any method including synthesis or transcription. Once introduced into a cell, 

5 the complementary nucleotides combine with natural sequences produced by the cell to form 
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duplexes and to block either transcription or translation. The designation "negative'' can refer to 
the antisense strand, and the designation "positive" can refer to the sense strand. 

The term "biologically active," refers to a protein having structural, regulatory, or 
biochemical functions of a naturally occurring molecule. Likewise, "immunologically active" 
5 refers to the capability of the natural, recombinant, or synthetic CSIGP, or of any oligopeptide 
thereof, to induce a specific immune response in appropriate animals or cells and to bind with 
specific antibodies. 

The terms "complementary" or "complementarity" refer to the natural binding of 
polynucleotides by base pairing. For example, the sequence "5' A-G-T 3*" bonds to the 
10 complementary sequence "3' T-C-A 5'." Complementarity between two single-stranded molecules 
may be "partial," such that only some of the nucleic acids bind, or it may be "complete," such that 
total complementarity exists between the single stranded molecules. The degree of 
complementarity between nucleic acid strands has significant effects on the efficiency and strength 
of the hybridization between the nucleic acid strands. This is of particular importance in 
15 amplification reactions, which depend upon binding between nucleic acids strands, and in the 
design and use of peptide nucleic acid (PNA) molecules. 

A "composition comprising a given polynucleotide sequence" or a "composition 
comprising a given amino acid sequence" refer broadly to any composition containing the given 
polynucleotide or amino acid sequence. The composition may comprise a dry formulation or an 
20 aqueous solution. Compositions comprising polynucleotide sequences encoding CSIGP or 
fragments of CSIGP may be employed as hybridization probes. The probes may be stored in 
freeze-dried form and may be associated with a stabilizing agent such as a carbohydrate. In 
hybridizations, the probe may be deployed in an aqueous solution containing salts (e.g., NaCI), 
detergents (e.g., sodium dodecyl sulfate; SDS), and other components (e.g., Denhardt's solution, 
25 dry milk, salmon sperm DNA, etc.), 

"Consensus sequence"refers to a nucleic acid sequence which has been resequenced to 
resolve uncalled bases, extended using XL-PCR kit (Perkin-EImer, Norwalk CT) in the 5* and/or 
the 3' direction, and resequenced, or which has been assembled from the overlapping sequences of 
more than one Incyte Clone using a computer program for fragment assembly, such as the 
30 GELVIEW Fragment Assembly system (GCG, Madison WI). Some sequences have been both 
extended and assembled to produce the consensus sequence. 

The term "correlates with expression of a polynucleotide" indicates that the detection of 
the presence of nucleic acids, the same or related to a nucleic acid sequence encoding CSIGP, by 
northern analysis is indicative of the presence of nucleic acids encoding CSIGP in a sample, and 
35 thereby correlates with expression of the transcript from the polynucleotide encoding CSIGP. 
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A/'deIetion"refers to a change in the amino acid or nucleotide sequence that results in the 
absence of one or more amino acid residues or nucleotides. 

The term "derivative" refers to the chemical modification of a polypeptide sequence, or a 
polynucleotide sequence. Chemical modifications of a polynucleotide sequence can include, for 
5 example, replacement of hydrogen by an alkyi, acyl, or amino group. A derivative polynucleotide 
encodes a polypeptide which retains at least one biological or immunological function of the 
natural molecule. A derivative polypeptide is one modified by glycosylation, pegylation, or any 
similar process that retains at least one biological or immunological function of the polypeptide 
from which it was derived. 
0 The term "similarity" refers to a degree of complementarity. There may be partial 

similarity or complete similarity. The word ''identity" may substitute for the word "similarity." A 
partially complementary sequence that at least partially inhibits an identical sequence from 
hybridizing to a target nucleic acid is referred to as "substantially similar." The inhibition of 
hybridization of the completely complementary sequence to the target sequence may be examined 
5 using a hybridization assay (Southern or northern blot, solution hybridization, and the like) under 
conditions of reduced stringency. A substantially similar sequence or hybridization probe will - 
compete for and inhibit the binding of a completely similar (identical) sequence to the target 
sequence under conditions of reduced stringency. This is not to say that conditions of reduced 
stringency are such that non-specific binding is permitted, as reduced stringency conditions 
require that the binding of two sequences to one another be a specific (i.e., a selective) interaction. 
The absence of non-specific binding may be tested by the use of a second target sequence which 
lacks even a partial degree of complementarity (e.g., less than about 30% similarity or identity); 
In the absence of non-specific binding, the substantially similar sequence or probe will not 
hybridize to the second non-complementary target sequence. 

The phrases "percent identity" or "% identity" refer to the percentage of sequence 
similarity found in a comparison of two or more amino acid or nucleic acid sequences. Percent 
identity can be determined electronically, e.g., by using the MEGALIGN program (DNASTAR, 
Madison WI). The MEGALIGN program can create alignments between two or more sequences 
according to different methods, e.g., the clustal method. (See, e.g., Higgins, D.G. and P.M. Sharp 
(1988) Gene 73:237-244.) The clustal algorithm groups sequences into clusters by examining the 
distances between all pairs. The clusters are aligned pairwise and then in groups. The percentage 
similarity between two amino acid sequences, e.g., sequence A and sequence B, is calculated by 
dividing the length of sequence A, minus the number of gap residues in sequence A, minus the 
number of gap residues in sequence B, into the sum of the residue matches between sequence A 
and sequence B, times one hundred. Gaps of low or of no similarity between the two amino acid 
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sequences are not included in determining percentage similarity. Percent identity between nucleic 
acid sequences can also be counted or calculated by other methods known in the art, e.g., the Jotun 
Hein method. (See, e.g., Hein, J. (1990) Methods Enzymoi. 183:626-645.) Identity between 
sequences can also be determined by other methods known in the art, e.g., by varying 
hybridization conditions. 

"Human artificial chromosomes'' (HACs) are linear microchromosomes which may 
contain DNA sequences of about 6 kb to 10 Mb in size, and which contain all of the elements 
required for stable mitotic chromosome segregation and maintenance. 

The term "humanized antibody" refers to antibody molecules in which the amino acid 
sequence in the non-antigen binding regions has been altered so that the antibody more closely 
resembles a human antibody, and still retains its original binding ability. 

"Hybridization" refers to any process by which a strand of nucleic acid binds with a 
complementary strand through base pairing. 

The term "hybridization complex" refers to a complex formed between two nucleic acid 
sequences by virtue of the formation of hydrogen bonds between complementary bases. A 
hybridization complex may be formed in solution (e.g., Qt or R^t analysis) or formed between one 
nucleic acid sequence present in solution and another nucleic acid sequence immobilized on a 
solid support (e.g., paper, membranes, filters, chips, pins or glass slides, or any other appropriate 
substrate to which cells or their nucleic acids have been fixed). 

The words "insertion" or "addition" refer to changes in an amino acid or nucleotide 
sequence resulting in the addition of one or more amino acid residues or nucleotides, respectively, 
to the sequence found in the naturally occurring molecule, 

"Immune response" can refer to conditions associated with inflammation, trauma, immune 
disorders, or infectious or genetic disease, etc. These conditions can be characterized by 
expression of various factors, e.g., cytokines, chemokines, and other signaling molecules, which 
may affect cellular and systemic defense systems. 

The term "microarray" refers to an arrangement of distinct polynucleotides on a substrate. 
The terms "element" or "array element" in a microarray context, refer to hybridizabie 
polynucleotides arranged on the surface of a substrate. 

The term "modulate" refers to a change in the activity of CSIGP. For example, 
modulation may cause an increase or a decrease in protein activity, binding characteristics, or any 
other biological, functional, or immunological properties of CSIGP. 

The phrases "nucleic acid" or "nucleic acid sequence" refer to a nucleotide, 
oligonucleotide, polynucleotide, or any fragment thereof These phrases also refer to DNA or 
RNA of genomic or synthetic origin which may be single-stranded or double-stranded and may 
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represent the sense or the antisense strand, to peptide nucleic acid (PNA), or to any DNA-like or 
RNA-Iike material. In this context, "fragments" refers to those nucleic acid sequences which, 
when translated, would produce polypeptides retaining some functional characteristic, e.g., 
antigenicity, or structural domain characteristic, e.g., ATP-binding site, of the full-length 
5 polypeptide. 

The terms "operably associated" or "operably linked" refer to functionally related nucleic 
acid sequences. A promoter is operably associated or operably linked with a coding sequence if 
the promoter controls the translation of the encoded polypeptide. While operably associated or 
operably linked nucleic acid sequences can be contiguous and in the same reading frame, certain 
10 genetic elements, e.g., repressor genes, are not contiguously linked to the sequence encoding the 
polypeptide but still bind to operator sequences that control expression of the polypeptide. 

The term "oligonucleotide" refers to a nucleic acid sequence of at least about 6 
nucleotides to 60 nucleotides, preferably about 15 to 30 nucleotides, and most preferably about 20 
to 25 nucleotides, which can be used in PCR amplification or in a hybridization assay or 
15 microarray. "Oligonucleotide" is substantially equivalent to the terms "amplimer," "primer," - 
"oligomer," and "probe," as these terms are commonly defined in the art. 

"Peptide nucleic acid" (PNA) refers to an antisense molecule or anti-gene agent which^^ 
comprises an oligonucleotide of at least about 5 nucleotides in length linked to a peptide backbone 
of amino acid residues ending in lysine. The terminal lysine confers solubility to the composition. 
20 PNAs preferentially bind complementary single stranded DNA or RNA and stop transcript ^ ~ 
elongation, and may be pegylated to extend their lifespan in the cell. 

The term "sample" is used in its broadest sense. A sample suspected of containing nucleic 
acids encoding CSIGP, or fragments thereof, or CSIGP itself, may comprise a bodily fluid; an 
extract from a cell, chromosome, organelle, or membrane isolated from a cell; a cell; genomic 
25 DNA, RNA, or cDNA, in solution or bound to a substrate; a tissue; a tissue print; etc. 

The terms "specific binding" or "specifically binding" refer to that interaction between a 
protein or peptide and an agonist, an antibody, or an antagonist. The interaction is dependent upon 
the presence of a particular structure of the protein, e.g., the antigenic determinant or epitope, 
recognized by the binding molecule. For example, if an antibody is specific for epitope "A," the 
30 presence of a polypeptide containing the epitope A, or the presence of free unlabeled A, in a 

reaction containing free labeled A and the antibody will reduce the amount of labeled A that binds 
to the antibody. 

The term "stringent conditions" refers to conditions which permit hybridization between 
polynucleotides and the claimed polynucleotides. Stringent conditions can be defined by salt 
35 concentration, the concentration of organic solvent, e.g., formamide, temperature, and other 
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conditions well known in the art. In particular, stringency can be increased by reducing the 
concentration of salt, increasing the concentration of formamide, or raising the hybridization 
temperature. 

The term "substantially purified" refers to nucleic acid or amino acid sequences that are 
5 removed from their natural environment and are isolated or separated, and are at least about 60% 
free, preferably about 75% free, and most preferably about 90% free from other components with 
which they are naturally associated. 

A "substitution" refers to the replacement of one or more amino acids or nucleotides by 
different amino acids or nucleotides, respectively, 
10 "Substrate" refers to any suitable rigid or semi-rigid support including membranes, filters, 

chips, slides, wafers, fibers, magnetic or nonmagnetic beads, gels, tubing, plates, polymers, 
microparticles and capillaries. The substrate can have a variety of surface forms, such as wells, 
trenches, pins, channels and pores, to which polynucleotides or polypeptides are bound. 

"Transformation" describes a process by which exogenous DNA enters and changes a 
1 5 recipient cell. Transformation may occur under natural or artificial conditions according to 
various methods well known in the art, and may rely on any known method for the insertion of 
foreign nucleic acid sequences into a prokaryotic or eukaryotic host cell. The method for 
transformation is selected based on the type of host cell being transformed and may include, but is 
not limited to, viral infection, electroporation, heat shock, lipofection, and particle bombardment. 
20 The term "transformed" cells includes stably transformed cells in which the inserted DNA is 
capable of replication either as an autonomously replicating plasmid or as part of the host 
chromosome, as well as transiently transformed cells which express the inserted DNA or RNA for 
limited periods of time. 

A "variant" of CSIGP polypeptides refers to an amino acid sequence that is altered by one 
25 or more amino acid residues. The variant may have "conservative" changes, wherein a substituted 
amino acid has similar structural or chemical properties (e.g., replacement of leucine with 
isoleucine). More rarely, a variant may have "nonconservative" changes (e.g., replacement of 
glycine with tryptophan). Analogous minor variations may also include amino acid deletions or 
insertions, or both. Guidance in determining which amino acid residues may be substituted, 
30 inserted, or deleted without abolishing biological or immunological activity may be found using 
computer programs well known in the art, for example, LASERGENE software (DNASTAR). 

The term "variant," when used in the context of a polynucleotide sequence, may 
encompass a polynucleotide sequence related to CSIGP. This definition may also include, for 
example, "allelic" (as defined above), "splice," "species," or "polymorphic" variants. A splice 
35 variant may have significant identity to a reference molecule, but will generally have a greater or 
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lesser number of polynucleotides due to alternate splicing of exons during mRNA processing. The 
corresponding polypeptide may possess additional functional domains or an absence of domains. 
Species variants are polynucleotide secfuences that vary from one species to another. The resulting 
polypeptides generally will have significant amino acid identity relative to each other. A 
5 polymorphic variant is a variation in the polynucleotide sequence of a particular gene between 
individuals of a given species. Polymorphic variants also may encompass "single nucleotide 
polymorphisms" (SNPs) in which the polynucleotide sequence varies by one base. The presence 
of SNPs may be indicative of, for example, a certain population, a disease state, or a propensity for 
a disease state. 

10 THE INVENTION 

The invention is based on the discovery of new human cell signaling proteins (CSIGP), 
the polynucleotides encoding CSIGP, and the use of these compositions for the diagnosis, 
treatment, or prevention of cell proliferative and inflammatory disorders. 

Table 1 lists the Incyte Clones used to derive full length nucleotide sequences encoding 

15 CSIGP. Columns 1 and 2 show the sequence identification numbers (SEQ ID NO) of the amino 
acid and nucleic acid sequences, respectively. Column 3 shows the Clone ID of the Incyte Clorie 
in which nucleic acids encoding each CSIGP were first identified, and column 4, the cDNA 
libraries from which these clones were isolated. Column 5 shows Incyte clones, their 
corresponding cDNA libraries, and shotgun sequences useful as fragments in hybridization ^ 

20 technologies, and which are part of the consensus nucleotide sequence of each CSIGP. - ' 
The columns of Table 2 show various properties of the polypeptides of the invention: 
column 1 references the SEQ ID NO; column 2 shows the number of amino acid residues in each 
polypeptide; column 3, potential phosphorylation sites; column 4, potential glycosylation sites; 
column 5, the amino acid residues comprising signature sequences and motifs; column 6, 

25 homologous sequences; and column 7, analytical methods used to identify each protein through 
sequence homology and protein motifs. 

The columns of Table 3 show the tissue-specificity and disease-association of nucleotide 
sequences encoding CSIGP. The first column of Table 3 lists the polynuclceotide sequence 
identifiers. The second column lists tissue categories which express CSIGP as a fraction of total 

30 tissue categories expressing CSIGP. The third column lists diseases, disorders, and conditiond 
associated with those tissues expressing CSIGP, The fourth column lists the vectors used to 
subclone the cDNA library. 

The following fragments of the nucleotide sequences encoding CSIGP are useful in 
hybridization or amplification technologies to identify SEQ ID NO: 14-26 and to distinguish 

35 between SEQ ID NO: 14-26 and similar polynucleotide sequences. The useful fragments are the 

-15- 



BNSOOCID: <WO 995e558A2J_> 



wo 99/58558 



PCT/US99/10567 



fragment of SEQ ID NO: 14 from about nucleotide 135 to about nucleotide 189, the 
fragment of SEQ ID NO: 15 from about nucleotide 493 to about nucleotide 558, the 
fragment of SEQ ID NO: 16 from about nucleotide 1 170 to about nucleotide 1233, the 
fragment of SEQ ID NO: 17 from about nucleotide 939 to about nucleotide 996, the 
5 fragment of SEQ ID NO: 1 8 from about nucleotide 424 to about nucleotide 486, the 
fragment of SEQ ID NO: 1 9 from about nucleotide 274 to about nucleotide 333, and the 
fragment of SEQ ID NO:20 from about nucleotide 1013 to about nucleotide 1070, the 
fragment of SEQ ID NO:21 from about nucleotide 284 to about nucleotide 325, the fragment of 
SEQ ID NO:22 from about nucleotide 642 to about nucleotide 674, the fragment of SEQ ID 
10 NO:23 from about nucleotide 742 to about nucleotide 769, the fragment of SEQ ID NO:24 from 
about nucleotide 457 to about nucleotide 486, the fragment of SEQ ID NO:25 from about 
nucleotide 205 to about nucleotide 246, and the fragment of SEQ ID NO:26 from about nucleotide 
3 19 to about nucleotide 342. 

The invention also encompasses CSIGP variants. A preferred CSIGP variant is one which 
15 has at least about 80%, more preferably at least about 90%, and most preferably at least about 95% 
amino acid sequence identity to the CSIGP amino acid sequence, and which contains at least one 
functional or structural characteristic of CSIGP. 

The invention also encompasses polynucleotides which encode CSIGP. In a particular 
embodiment, the invention encompasses a polynucleotide sequence comprising a sequence 
20 selected from the group consisting of SEQ ID NO: 14-26 which encodes CSIGP. 

The invention also encompasses a variant of a polynucleotide sequence encoding CSIGP. 
In particular, such a variant polynucleotide sequence will have at least about 70%, more preferably 
at least about 85%, and most preferably at least about 95% polynucleotide sequence identity to the 
polynucleotide sequence encoding CSIGP. A particular aspect of the invention encompasses a 
25 variant of a polynucleotide sequence comprising a sequence selected from the group consisting of 
SEQ ID NO: 14-26 which has at least about 70%, more preferably at least about 85%, and most 
preferably at least about 95% polynucleotide sequence identity to a nucleic acid sequence selected 
from the group consisting of SEQ ID NO: 14-26. Any one of the polynucleotide variants described 
above can encode an amino acid sequence which contains at least one functional or structural 
30 characteristic of CSIGP 

It will be appreciated by those skilled in the art that as a result of the degeneracy of the 
genetic code, a multitude of polynucleotide sequences encoding CSIGP, some bearing minimal 
similarity to the polynucleotide sequences of any known and naturally occurring gene, may be 
produced. Thus, the invention contemplates each and every possible variation of polynucleotide 
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sequence that could be made by selecting combinations based on possible codon choices. These 
combinations are made in accordance with the standard triplet genetic code as applied to the 
polynucleotide sequence of naturally occurring CSIGP, and all such variations are to be 
considered as being specifically disclosed. 
5 Although nucleotide sequences which encode CSIGP and its variants are preferably 

capable of hybridizing to the nucleotide sequence of the naturally occurring CSIGP under 
appropriately selected conditions of stringency, it may be advantageous to produce nucleotide 
sequences encoding CSIGP or its derivatives possessing a substantially different codon usage, 
e.g., inclusion of non-naturally occurring codons. Codons may be selected to increase the rate at 

10 which expression of the peptide occurs in a particular prokaryotic or eukaryotic host in accordance 
with the frequency with which particular codons are utilized by the host. Other reasons for 
substantially altering the nucleotide sequence encoding CSIGP and its derivatives without altering 
the encoded amino acid sequences include the production of RNA transcripts having more 
desirable properties, such as a greater half-life, than transcripts produced from the naturally 

15 occurring sequence. 

The invention also encompasses production of DNA sequences which encode CSIGP and 
CSIGP derivatives, or fragments thereof, entirely by synthetic chemistry. After production, the 
synthetic sequence may be inserted into any of the many available expression vectors and cell 
systems using reagents well known in the art. Moreover, synthetic chemistry may be used to 

20 introduce mutations into a sequence encoding CSIGP or any fragment thereof. 

Also encompassed by the invention are polynucleotide sequences that are capable of 
hybridizing to the claimed polynucleotide sequences, and, in particular, to those shown in SEQ ID 
NO: 14-26 and fragments thereof under various conditions of stringency. (See, e.g., Wahl, G.M. 
and S.L. Berger (1987) Methods Enzymol. 152:399-407; Kimmel, A.R. (1987) Methods EnzymoL 

25 1 52:507-5 1 1 .) For example, stringent salt concentration will ordinarily be less than about 750 mM 
NaCl and 75 mM trisodium citrate, preferably less than about 500 mM NaCl and 50 mM trisodium 
citrate, and most preferably less than about 250 mM NaCl and 25 mM trisodium citrate. Low 
stringency hybridization can be obtained in the absence of organic solvent, e.g., formamide, while 
high stringency hybridization can be obtained in the presence of at least about 35% formamide, 

30 and most preferably at least about 50% formamide. Stringent temperature conditions will 

ordinarily include temperatures of at least about 30°C, more preferably of at least about 37°C, and 
most preferably of at least about 42°C. Varying additional parameters, such as hybridization time, 
the concentration of detergent, e.g., sodium dodecyi sulfate (SDS), and the inclusion or exclusion 
of carrier DNA, are well known to those skilled in the art. Various levels of stringency are 

35 accomplished by combining these various conditions as needed. In a preferred embodiment, 
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hybridization will occur at 30°C in 750 mM NaCL 75 mM trisodium citrate, and 1% SDS. In a 
more preferred embodiment, hybridization will occur at 37''C in 500 mM NaCK 50 mM trisodium 
citrate, 1% SDS, 35% formamide, and 100 jug/m\ denatured salmon sperm DNA (ssDNA). In a 
most preferred embodiment, hybridization will occur at 42°C in 250 mM NaCl, 25 mM trisodium 
citrate, 1% SDS, 50 % formamide, and 200 yug/ml ssDNA. Useful variations on these conditions 
will be readily apparent to those skilled in the art. 

The washing steps which follow hybridization can also vary in stringency. Wash 
stringency conditions can be defined by salt concentration and by temperature. As above, wash 
stringency can be increased by decreasing salt concentration or by increasing temperature. For 
example, stringent salt concentration for the wash steps will preferably be less than about 30 mM 
NaCl and 3 mM trisodium citrate, and most preferably less than about 15 mM NaCl and 1 .5 mM 
trisodium citrate. Stringent temperature conditions for the wash steps will ordinarily include 
temperature of at least about 25''C, more preferably of at least about 42°C, and most preferably of 
at least about eS^'C. In a preferred embodiment, wash steps will occur at 25*^C in 30 mM NaCl 3 
mM trisodium citrate, and 0.1% SDS, In a more preferred embodiment, wash steps will occur at 
42°C in 15 mM NaCI, 1 .5 mM trisodium citrate, and 0.1% SDS. In a most preferred embodiment, 
wash steps will occur at 6S^C in 15 mM NaCl, 1,5 mM trisodium citrate, and 0.1% SDS. 
Additional variations on these conditions will be readily apparent to those skilled in the art. 

Methods for DNA sequencing are well known in the art and may be used to practice any 
of the embodiments of the invention. The methods may employ such enzymes as the Klenow 
fragment of DNA polymerase I, SEQUENASE (US Biochemical, Cleveland OH), Taq polymerase 
(Perkin-Eimer), thermostable T7 polymerase (Amersham Pharmacia Biotech, Piscataway NJ), or 
combinations of polymerases and proofreading exonucleases such as those found in the 
ELONGASE amplification system (Life Technologies, Gaithersburg MD). Preferably, sequence 
preparation is automated with machines such as the HYDRA microdispenser (Robbins Scientific, 
Sunnyvale CA), MICROLAB 2200 (Hamilton, Reno NV), Peltier Thermal Cycler 200 (PTC200; 
MJ Research, Watenown MA) and the ABI CATALYST 800 (Perkin-Eimer). Sequencing is then 
carried out using either ABI 373 or 377 DNA Sequencing Systems (Perkin-Eimer) or the 
MEGABACE 1000 DNA sequencing system (Molecular Dynamics, Sunnyvale CA). The 
resulting sequences are analyzed using a variety of algorithms which are well known in the art. 
(See, e.g., Ausubel, P.M. (1997) Short Protocols in Molecular Biologv . John Wiley & Sons, New 
York NY, unit 7.7; Meyers, R.A. (1995) Molecular Biologv and Biotechnology . Wiley VCH, New 
York NY, pp. 856-853.) 

The nucleic acid sequences encoding CSIGP may be extended utilizing a partial 
nucleotide sequence and employing various PCR-based methods known in the art to detect 
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upstream sequences, such as promoters and regulatory elements. For example, one method which 
may be employed, restriction-site PCR, uses universal and nested primers to amplify unknown 
sequence from genomic DNA within a cloning vector. (See, e.g., Sarkar, G. (1993) PCR Methods 
Applic. 2:318-322.) Another method, inverse PCR, uses primers that extend in divergent 
5 directions to amplify unknown sequence from a circularized template. The template is derived 
from restriction fragments comprising a known genomic locus and surrounding sequences. (See, 
e.g., Triglia, T. et al. (1988) Nucleic Acids Res. 16:8186.) A third method, capture PCR, involves 
PCR amplification of DNA fragments adjacent to known sequences in human and yeast artificial 
chromosome DNA. (See, e.g., Lagerstrom, M, et al. (1991) PCR Methods Applic. 1:111-119.) In 

10 this method, multiple restriction enzyme digestions and ligations may be used to insert an 

engineered double-stranded sequence into a region of unknown sequence before performing PCR. 
Other methods which may be used to retrieve unknown sequences are known in the art. (See, e.g., 
Parker, J.D. et al. ( 1991) Nucleic Acids Res. 19:3055-306). Additionally, one may use PCR, 
nested primers, and PROMOTERFINDER libraries (Clontech, Palo Alto CA) to walk genomic 

15 DNA. This procedure avoids the need to screen libraries and is useful in finding intron/exon - ; 
junctions. For all PCR-based methods, primers may be designed using commercially available:* 
software, such as OLIGO 4.06 Primer Analysis software (National Biosciences, Plymouth MN)"or 
another appropriate program, to be about 22 to 30 nucleotides in length, to have a GC content of 
about 50% or more, and to anneal to the template at temperatures of about 68''C to 72°C. 

20 When screening for full-length cDNAs, it is preferable to use libraries that have been - 

size-selected to include larger cDNAs. In addition, random-primed libraries, which often include^ 
sequences containing the 5' regions of genes, are preferable for situations in which an oiigo d(T) 
library does not yield a full-length cDNA. Genomic libraries may be useful for extension of 
sequence into 5' non-transcribed regulatory regions. 

25 Capillary electrophoresis systems which are commercially available may be used to 

analyze the size or confirm the nucleotide sequence of sequencing or PCR products. In particular, 
capillary sequencing may employ flowable polymers for electrophoretic separation, four different 
nucleotide-specific, laser-stimulated fluorescent dyes, and a charge coupled device camera for 
detection of the emitted wavelengths. Output/light intensity may be converted to electrical signal 

30 using appropriate software (e.g., GENOTYPER and SEQUENCE NAVIGATOR, Perkin-Elmer), 
and the entire process from loading of samples to computer analysis and electronic data display 
may be computer controlled. Capillary electrophoresis is especially preferable for sequencing 
small DNA fragments which may be present in limited amounts in a particular sample. 

In another embodiment of the invention, polynucleotide sequences or fragments thereof 

35 which encode CSIGP may be cloned in recombinant DNA molecules that direct expression of 
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CSIGP, or fragments or functional equivalents thereof, in appropriate host cells. Due to the 
inherent degeneracy of the genetic code, other DNA sequences which encode substantially the 
same or a functionally equivalent amino acid sequence may be produced and used to express 
CSIGP. 

5 The nucleotide sequences of the present invention can be engineered using methods 

generally known in the art in order to alter CSIGP-encoding sequences for a variety of purposes 
including, but not limited to, modification of the cloning, processing, and/or expression of the 
gene product. DNA shuffling by random fragmentation and PGR reassembly of gene fragments 
and synthetic oligonucleotides may be used to engineer the nucleotide sequences. For example, 
10 oligonucleotide-mediated site-directed mutagenesis may be used to introduce mutations that create 
new restriction sites, alter glycosylation patterns, change codon preference, produce splice 
variants, and so forth. 

In another embodiment, sequences encoding CSIGP may be synthesized, in whole or in 
part, using chemical methods well known in the an. (See, e.g., Caruthers, M.H. et al. (1980) Nucl. 

15 Acids Res. Symp. Ser. 215-223, and Horn, T. et ai. (1980) Nucl. Acids Res. Symp. Ser. 225-232.) 
Alternatively, CSIGP itself or a fragment thereof may be synthesized using chemical methods. 
For example, peptide synthesis can be performed using various solid-phase techniques. (See, e.g., 
Roberge, J.Y. et al. (1995) Science 269:202-204.) Automated synthesis may be achieved using 
the ABI 43 1 A Peptide Synthesizer (Perkin-Elmer). Additionally, the amino acid sequence of 

20 CSIGP, or any part thereof, may be altered during direct synthesis and/or combined with 
sequences from other proteins, or any part thereof, to produce a variant polypeptide. 

The peptide may be substantially purified by preparative high performance liquid 
chromatography. (See, e.g, Chiez, R.M. and F.Z. Regnier (1990) Methods Enzymol. 182:392- 
421 .) The composition of the synthetic peptides may be confirmed by amino acid analysis or by 

25 sequencing. (See, e.g., Creighton, T. (1984) Proteins, Structures and Molecular Properties . WH 
Freeman, New York NY.) 

In order to express a biologically active CSIGP, the nucleotide sequences encoding 
CSIGP or derivatives thereof may be inserted into an appropriate expression vector, i.e., a vector 
which contains the necessary elements for transcriptional and translational control of the inserted 

30 coding sequence in a suitable host. These elements include regulatory sequences, such as 

enhancers, constitutive and inducible promoters, and 5' and 3' untranslated regions in the vector 
and in polynucleotide sequences encoding CSIGP. Such elements may vary in their strength and 
specificity. Specific initiation signals may also be used to achieve more efficient translation of 
sequences encoding CSIGP. Such signals include the ATG initiation codon and adjacent 

35 sequences, e.g. the Kozak sequence. In cases where sequences encoding CSIGP and its initiation 
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codon and upstream regulatory sequences are insened into the appropriate expression vector, no 
additional transcriptional or iranslational control signals may be needed. However, in cases where 
only coding sequence, or a fragment thereof, is inserted, exogenous translational control signals 
including an in-frame ATG initiation codon should be provided by the vector. Exogenous 
5 translational elements and initiation codons may be of various origins, both natural and synthetic. 
The efficiency of expression may be enhanced by the inclusion of enhancers appropriate for the 
particular host cell system used. (See, e.g., Scharf, D. et al. (1994) Results Probl. Cell Differ. 
20:125-162.) 

Methods which are well known to those skilled in the art may be used to construct 
to expression vectors containing sequences encoding CSIGP and appropriate transcriptional and 
translational control elements. These methods include in vitro recombinant DNA techniques, 
synthetic techniques, and in.viyo genetic recombination. (See, e.g., Sambrook, J. et al. (1989) 
Molecular Cloning. A Laboratory Manual . Cold Spring Harbor Press, Plainview NY, ch. 4, 8, and 
16-17; Ausubel, F.M. et al. ( 1 995) Current Protocols in Molecular Biology . John Wiley & Sons, 

1 5 New York NY, ch. 9, 1 3, and 16.) 

A variety of expression vector/host systems may be utilized to contain and express 
sequences encoding CSIGP. These include, but are not limited to, microorganisms such as 
bacteria transformed with recombinant bacteriophage, plasmid, or cosmid DNA expression 
vectors; yeast transformed with yeast expression vectors; insect cell systems infected with viral 

20 expression vectors (e.g., baculovirus); plant cell systems transformed with viral expression vectors 
(e.g., cauliflower mosaic virus, CaMV, or tobacco mosaic virus. TMV) or with bacterial - ^ 
expression vectors (e.g., Ti or pBR322 plasmids); or animal cell systems. The invention is not 
limited by the host cell employed. 

In bacterial systems, a number of cloning and expression vectors may be selected 

25 depending upon the use intended for polynucleotide sequences encoding CSIGP. For example, 
routine cloning, subcloning, and propagation of polynucleotide sequences encoding CSIGP can be 
achieved using a multifunctional E. coli vector such as PBLUESCRIPT (Stratagene, La Jolla CA) 
or pSPORTl plasmid (Life Technologies). Ligation of sequences encoding CSIGP into the 
vector's multiple cloning site disrupts the lacZ gene, allowing a colorimetric screening procedure 

30 for identification of transformed bacteria containing recombinant molecules. In addition, these 
vectors may be useful for in vitro transcription, dideoxy sequencing, single strand rescue with 
helper phage, and creation of nested deletions in the cloned sequence. (See, e.g., Van Heeke, G. 
and S.M. Schuster ( 1989) J. Biol. Chem. 264:5503-5509.) When large quantities of CSIGP are 
needed, e.g. for the production of antibodies, vectors which direct high level expression of CSIGP 

35 may be used. For example, vectors containing the strong, inducible T5 or T7 bacteriophage 
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promoter may be used. 

Yeast expression systems may be used for production of CSIGP. A number of vectors 
containing constitutive or inducible promoters, such as alpha factor, alcohol oxidase, and PGH, 
may be used in the yeast Saccharomvces cerevisiae or Pichia oastoris . In addition, such vectors 
5 direct either the secretion or intracellular retention of expressed proteins and enable integration of 
foreign sequences into the host genome for stable propagation. (See, e.g., AusubeL 1995, supra : 
Grant et al. (1987) Methods Enzymol. 153:516-54; and Scorer, C. A. et ah (1994) Bio/Technology 
12:181-184.) 

Plant systems may also be used for expression of CSIGP. Transcription of sequences 
10 encoding CSIGP may be driven viral promoters, e.g., the 35S and 19S promoters of CaMV used 
alone or in combination with the omega leader sequence from TMV (Takamatsu, N. (1987) 
EMBO J. 6:307-31 1), Alternatively, plant promoters such as the small subunit of RUBISCO or 
heat shock promoters may be used. (See, e.g., Coruzzi, G. et al. (1984) EMBO J. 3:1671-1680; 
Broglie, R. et al. (1984) Science 224:838-843; and Winter, J. et al. (1991) Results Probl. Cell 

15 Differ. 17:85-105.) These constructs can be introduced into plant cells by direct DNA 

transformation or pathogen-mediated transfection. (See, e.g.. The McGraw Hill Yearbook of 
Science and Technology (1992^ McGraw Hill, New York NY, pp. 191-196.) 

In mammalian cells, a number of viral-based expression systems may be utilized. In cases 
where an adenovirus is used as an expression vector, sequences encoding CSIGP may be ligated 

20 into an adenovirus transcription/translation complex consisting of the late promoter and tripartite 
leader sequence. Insertion in a non-essential El or E3 region of the viral genome may be used to 
obtain infective virus which expresses CSIGP in host cells. (See, e.g., Logan, J. and T. Shenk 
(1984) Proc. Natl. Acad. Set. 81:3655-3659.) In addition, transcription enhancers, such as the 
Rous sarcoma virus (RSV) enhancer, may be used to increase expression in mammalian host cells. 

25 SV40 or EBV-based vectors may also be used for high-level protein expression. 

Human artificial chromosomes (HACs) may also be employed to deliver larger fragments 
of DNA than can be contained in and expressed from a piasmid. HACs of about 6 kb to 10 Mb 
are constructed and delivered via conventional delivery methods (liposomes, polycationic amino 
polymers, or vesicles) for therapeutic purposes. (See, e.g., Harrington, J.J. et al, (1997) Nat Genet. 

30 15:345-355.) 

For long term production of recombinant proteins in mammalian systems, stable 
expression of CSIGP in cell lines is preferred. For example, sequences encoding CSIGP can be 
transformed into cell lines using expression vectors which may contain viral origins of replication 
and/or endogenous expression elements and a selectable marker gene on the same or on a separate 
35 vector. Following the introduction of the vector, cells may be allowed to grow for about 1 to 2 
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days in enriched media before being switched to selective media. The purpose of the selectable 
marker is to confer resistance to a selective agent, and its presence allows growth and recovery of 
cells which successfully express the introduced sequences. Resistant clones of stably transformed 
cells may be propagated using tissue culture techniques appropriate to the cell type. 

5 Any number of selection systems may be used to recover transformed cell lines. These 

include, but are not limited to, the herpes simplex virus thymidine kinase and adenine 
phosphoribosyltransferase genes, for use in tkr or apr cells, respectively. (See, e.g., Wigler, M. et 
al. (1977) Cell 1 1:223-232; Lowy, 1. et al. (1980) Cell 22:817-823.) Also, antimetabolite, 
antibiotic, or herbicide resistance can be used as the basis for selection. For example, dhfr confers 

0 resistance to methotrexate; neo confers resistance to the aminoglycosides neomycin and G-418; 
and als or pat confer resistance to chlorsulfuron and phosphinotricin acetyltransferase, 
respectively. (See, e.g., Wigler, M. et al. (1980) Proc. Natl. Acad. Sci. 77:3567-3570; 
* Colbere-Garapin, F. et al. (1981) J. Mol. Biol. 150:1-14) Additional selectable genes have been 
described, e.g., trpB and hisD, which alter cellular requirements for metabolites. (See, e.g., 

5 Hartman, S.C. and R.C. Mulligan (1988) Proc. Natl. Acad. Sci. 85:8047-8051.) Visible markers, 
e.g., anthocyanins, green fluorescent proteins (GFP; Clontech), fl glucuronidase and its substrate 
6-gIucuronide, or luciferase and its substrate luciferin may be used. These markers can be used* 
not only to identify transform ants, but also to quantify the amount of transient or stable protein 
expression attributable to a specific vector system. (See, e.g., Rhodes, C.A. (1995) Methods Mol. 

0 Biol. 55:121-131.) 

Although the presence/absence of marker gene expression suggests that the gene of . r' 
interest is also present, the presence and expression of the gene may need to be confirmed. For 
example, if the sequence encoding CSIGP is inserted within a marker gene sequence, transformed 
cells containing sequences encoding CSIGP can be identified by the absence of marker gene 

5 function. Alternatively, a marker gene can be placed in tandem with a sequence encoding CSIGP 
under the control of a single promoter. E.xpression of the marker gene in response to induction or 
selection usually indicates expression of the tandem gene as well. 

In general, host cells that contain the nucleic acid sequence encoding CSIGP and that 
express CSIGP may be identified by a variety of procedures known to those of skill in the art. 

0 These procedures include, but are not limited to, DNA-DNA or DNA-RNA hybridizations, PCR 
amplification, and protein bioassay or immunoassay techniques which include membrane, 
solution, or chip based technologies for the detection and/or quantification of nucleic acid or 
protein sequences. 

Immunological methods for detecting and measuring the expression of CSIGP using either 
5 specific polyclonal or monoclonal antibodies are known in the art. Examples of such techniques 



# 
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include enzyme- linked immunosorbent assays (ELISAs), radioimmunoassays (RJAs), and 
fluorescence activated cell sorting (FACS), A two-site, monoclonal-based immunoassay utilizing 
monoclonal antibodies reactive to two non- interfering epitopes on CSIGP is preferred, but a 
competitive binding assay may be employed. These and other assays are well known in the art. 
5 (See, e.g., Hampton, R. et al. ( 1 990) Serological Methods, a Laboratory Manual . APS Press, St 
Paul MN, Sect, !V; Coligan, J. E. et al. (1997) Current Protocols in Immunolog v. Greene Pub. 
Associates and Wiley-lnterscience, New York NY; and Pound, J.D. (1998) Immunochemical 
Protocols .Humana Press, Totowa NJ). 

A wide variety of labels and conjugation techniques are known by those skilled in the art 

10 and may be used in various nucleic acid and amino acid assays. Means for producing labeled 
hybridization or PGR probes for detecting sequences related to polynucleotides encoding CSIGP 
include oiigolabeling, nick translation, end-labeling, or PCR amplification using a labeled 
nucleotide. Alternatively, the sequences encoding CSIGP, or any fragments thereof, may be 
cloned into a vector for the production of an mRNA probe. Such vectors are known in the art, are 

15 commercially available, and may be used to synthesize RNA probes in vitro by addition of an 
appropriate RNA polymerase such as T7, T3, or SP6 and labeled nucleotides. These procedures 
may be conducted using a variety of commercially available kits, such as those provided by 
Amersham Pharmacia Biotech, Promega (Madison WI), and US Biochemical. Suitable reporter 
molecules or labels which may be used for ease of detection include radionuclides, enzymes, 

20 fluorescent, chemi luminescent, or chromogenic agents, as well as substrates, cofactors, inhibitors, 
magnetic particles, and the like. 

Host ceils transformed with nucleotide sequences encoding CSIGP may be cultured under 
conditions suitable for the expression and recovery of the protein from cell culture. The protein 
produced by a transformed cell may be secreted or retained intracellularly depending on the 

25 sequence and/or the vector used. As will be understood by those of skill in the art, expression 
vectors containing polynucleotides which encode CSIGP may be designed to contain signal 
sequences which direct secretion of CSIGP through a prokaryotic or eukaryotic cell membrane. 

In addition, a host cell strain may be chosen for its ability to modulate expression of the 
inserted sequences or to process the expressed protein in the desired fashion. Such modifications 

30 of the polypeptide include, but are not limited to, acetylation, carboxylation, glycosylation, 

phosphorylation, lipidation, and acylation. Post-translationai processing which cleaves a "prepro" 
form of the protein may also be used to specify protein targeting, folding, and/or activity. 
Different host cells which have specific cellular machinery and characteristic mechanisms for 
post-translational activities (e.g., CHO, HeLa, MDCK, HEK293, and W138), are available from 

35 the American Type Culture Collection (ATCC, Bethesda MD) and may be chosen to ensure the 
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correct modification and processing of the foreign protein. 

In another embodiment of the invention, natural, modified, or recombinant nucleic acid 
sequences encoding CSIGP may be ligated to a heterologous sequence resulting in translation of a 
fusion protein in any of the aforementioned host systems. For example, a chimeric CSIGP protein 
5 containing a heterologous moiety that can be recognized by a commercially available antibody 
may facilitate the screening of peptide libraries for inhibitors of CSIGP activity. Heterologous 
protein and peptide moieties may also facilitate purification of fusion proteins using commercially 
available affinity matrices. Such moieties include, but are not limited to, glutathione S-transferase 
(GST), maltose binding protein (MBP), thioredoxin (Trx), calmodulin binding peptide (CBP), 6- 

10 His, FLAG, c-myc, and hemagglutinin (HA). GST, MBP, Trx, CBP, and 6-His enable purification 
of their cognate fusion proteins on immobilized glutathione, maltose, phenylarsine oxide, 
calmodulin, and metal-chelate resins, respectively. FLAG, c-myc, and hemagglutinin (HA) enable 
immunoaffinity purification of fusion proteins using commercially available monoclonal and 
polyclonal antibodies that specifically recognize these epitope tags. A fusion protein may also be 

15 engineered to contain a proteolytic cleavage site located between the CSIGP encoding sequence^ 
and the heterologous protein sequence, so that CSIGP may be cleaved away from the heterologous 
moiety following purification. Methods for fusion protein expression and purification are 
discussed in Ausubel (1995, supra , ch 10). A variety of commercially available kits may also be 
used to facilitate expression and purification of fusion proteins. 

20 In a further embodiment of the invention, synthesis of radiolabeled CSIGP may be 

achieved in vitro using the TNT rabbit reticulocyte lysate or wheat germ extract systems 
(Promega). These systems couple transcription and translation of protein-coding sequences 
operably associated with the T7, T3, or SP6 promoters. Translation takes place in the presence of 
a radiolabeled amino acid precursor, preferably ^^S-methionine. 

25 Fragments of CSIGP may be produced not only by recombinant production, but also by 

direct peptide synthesis using solid-phase techniques, (See, e.g., Creighton, supra pp. 55-60.) 
Protein synthesis may be performed by manual techniques or by automation. Automated synthesis 
may be achieved, for example, using the ABI 43 1 A Peptide Synthesizer (Perkin-Elmer). Various 
fragments of CSIGP may be synthesized separately and then combined to produce the full length 

30 molecule. 



THERAPEUTICS 

Chemical and structural similarity, e.g., in the context of sequences and motifs, exists 
between CSIGP and cell signaling proteins. In addition, the expression of CSIGP is closely 
35 associated with cell proliferation and inflammatory disorders. Therefore, in cell proliferative and 
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inflammatory disorders where CSIGP is an inhibitor or suppressor of cell proliferation, it is 
desirable to increase the expression of CSIGP. In cell proliferative and inflammatory disorders 
where CSIGP is an activator or enhancer and is promoting cell proliferation, it is desirable to 
decrease the expression of CSIGP. 
5 Therefore, in one embodiment, CSIGP or a fragment or derivative thereof may be 

administered to a subject to treat or prevent a disorder associated with decreased expression or 
activity of CSIGP. Examples of such disorders include, but are not limited to, a cell proliferative 
disorder such as actinic keratosis, arteriosclerosis, atherosclerosis, bursitis, cirrhosis, hepatitis, 
mixed connective tissue disease (MCTD), myelofibrosis, paroxysmal nocturnal hemoglobinuria, 

10 polycythemia vera, psoriasis, primary thrombocythemia; cancers including adenocarcinoma, 

leukemia, lymphoma, melanoma, myeloma, sarcoma, teratocarcinoma, and, in particular, cancers 
of the adrenal gland, bladder, bone, bone marrow, brain, breast, cervix, gall bladder, ganglia, 
gastrointestinal tract, heart, kidney, liver, lung, muscle, ovary, pancreas, parathyroid, penis, 
prostate, salivary glands, skin, spleen, testis, thymus, thyroid, and uterus; and an inflammatory 

15 disorder such as acquired immunodeficiency syndrome (AIDS), Addison's disease, adult 

respiratory distress syndrome, allergies, ankylosing spondylitis, amyloidosis, anemia, asthma, 
atherosclerosis, autoimmune hemolytic anemia, autoimmune thyroiditis, bronchitis, cholecystitis, 
contact dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis, diabetes mellitus, 
emphysema, episodic lymphopenia with lymphocytotoxins, erythroblastosis fetalis, erythema 

20 nodosum, atrophic gastritis, glomerulonephritis, Goodpasture's syndrome, gout, Graves' disease, 
Hashimoto's thyroiditis, hypereosinophilia, irritable bowel syndrome, multiple sclerosis, 
myasthenia gravis, myocardial or pericardial inflammation, osteoarthritis, osteoporosis, 
pancreatitis, polymyositis, psoriasis. Reiter's syndrome, rheumatoid arthritis, scleroderma, 
Sjogren's syndrome, systemic anaphylaxis, systemic lupus erythematosus, systemic sclerosis, 

25 thrombocytopenic purpura, ulcerative colitis, uveitis, Werner syndrome, complications of cancer, 
hemodialysis, and extracorporeal circulation, viral, bacterial, fungal, parasitic, protozoal, and 
helminthic infections, and trauma. 

In another embodiment, a vector capable of expressing CSIGP or a fragment or derivative 
thereof may be administered to a subject to treat or prevent a disorder associated with decreased 
0 expression or activity of CSIGP including, but not limited to, those described above. 

In a further embodiment, a pharmaceutical composition comprising a substantially 
purified CSIGP in conjunction with a suitable pharmaceutical carrier may be administered to a 
subject to treat or prevent a disorder associated with decreased expression or activity of CSIGP 
including, but not limited to, those provided above. 
5 In still another embodiment, an agonist which modulates the activity of CSIGP may be 
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administered to a subject to treat or prevent a disorder associated with decreased expression or 
activity of CSIGP including, but not limited to, those listed above. 

In a further embodiment, an antagonist of CSIGP may be administered to a subject to treat 
or prevent a disorder associated with increased expression or activity of CSIGP. Examples of 
5 such disorders include, but are not limited to, those described above. In one aspect, an antibody 
which specifically binds CSIGP may be used directly as an antagonist or indirectly as a targeting 
or delivery mechanism for bringing a pharmaceutical agent to cells or tissue which express 
CSIGP. 

In an additional embodiment, a vector expressing the complement of the polynucleotide 
10 encoding CSIGP may be administered to a subject to treat or prevent a disorder associated with 
increased expression or activity of CSIGP including, but not limited to, those described above. 

In other embodiments, any of the proteins, antagonists, antibodies, agonists, 
complementary sequences, or vectors of the invention may be administered in combination with 
other appropriate therapeutic agents. Selection of the appropriate agents for use in combination 
15 therapy may be made by one of ordinary skill in the art, according to conventional pharmaceutical - 
principles. The combination of therapeutic agents may act synergistically to effect the treatment 
or prevention of the various disorders described above. Using this approach, one may be able to^ - 
achieve therapeutic efficacy with lower dosages of each agent, thus reducing the potential for - 
adverse side effects. 

20 An antagonist of CSIGP may be produced using methods which are generally known iri-'^ 

the art. In particular, purified CSIGP may be used to produce antibodies or to screen libraries of 
pharmaceutical agents to identify those which specifically bind CSIGP. Antibodies to CSIGP may 
also be generated using methods that are well known in the art. Such antibodies may include, but 
are not limited to, polyclonal, monoclonal, chimeric, and single chain antibodies. Fab fragments, 

25 and fragments produced by a Fab expression library. Neutralizing antibodies (i.e., those which 
inhibit dimer formation) are especially preferred for therapeutic use. 

For the production of antibodies, various hosts including goats, rabbits, rats, mice, 
humans, and others may be immunized by injection with CSIGP or with any fragment or 
oligopeptide thereof which has immunogenic properties. Depending on the host species, various 

30 adjuvants may be used to increase immunological response. Such adjuvants include, but are not 
limited to, Freund's, mineral gels such as aluminum hydroxide, and surface active substances such 
as lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, KLH, and dinitrophenol. 
Among adjuvants used in humans, BCG (bacilli Calmette-Guerin) and Corvnebacterium parvum 
are especially preferable. 

^5 It is preferred that the oligopeptides, peptides, or fragments used to induce antibodies to 
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CSIGP have an amino acid sequence consisting of at least about 5 amino acids, and, more 
preferably, of at least about 10 amino acids. It is also preferable that these oligopeptides, peptides, 
or fragments are identical to a portion of the amino acid sequence of the natural protein and 
contain the entire amino acid sequence of a small, naturally occurring molecule. Short stretches of 
5 CSIGP amino acids may be fused with those of another protein, such as FCLH, and antibodies to 
the chimeric molecule may be produced. 

Monoclonal antibodies to CSIGP may be prepared using any technique which provides for 
the production of antibody molecules by continuous cell lines in culture. These include, but are 
not limited to, the hybridoma technique, the human B-cell hybridoma technique, and the EBV- 
10 hybridoma technique. (See, e.g., Kohler, G. et al. (1975) Nature 256:495-497; Kozbor, D. et al. 
(1985) J. Immunol. Methods 81:31-42; Cote, R.J. et al, (1983) Proc. Natl. Acad. Sci. 
80:2026-2030; and Cole, S.P. et al. (1984) Mol. Cell Biol. 62:109-120.) 

In addition, techniques developed for the production of "chimeric antibodies," such as the 
splicing of mouse antibody genes to human antibody genes to obtain a molecule with appropriate 
15 antigen specificity and biological activity, can be used, (See, e.g., Morrison, S.L. et al. (1984) 
Proc. Natl. Acad. Sci. 81:6851-6855; Neuberger, M.S. et al. (1984) Nature 3 12:604-608; and 
Takeda, S. et al. (1985) Nature 3 14:452-454.) Alternatively, techniques described for the 
production of single chain antibodies may be adapted, using methods known in the art, to produce 
CSIGP-specific single chain antibodies. Antibodies with related specificity, but of distinct 
20 idiotypic composition, may be generated by chain shuffling from random combinatorial 

immunoglobulin libraries. (See, e.g., Burton D.R. (1991) Proc. Natl. Acad. Sci. 88:10134-10137.) 

Antibodies may also be produced by inducing in vivo production in the lymphocyte 
population or by screening immunoglobulin libraries or panels of highly specific binding reagents 
as disclosed in the literature, (See, e.g., Orlandi, R. et al. (1989) Proc. Natl. Acad. Sci. 86: 
25 3833-3837; Winter, G. et al. (1991) Nature 349:293-299.) 

Antibody fragments which contain specific binding sites for CSIGP may also be 
generated. For example, such fragments include, but are not limited to, F(ab')2 fragments 
produced by pepsin digestion of the antibody molecule and Fab fragments generated by reducing 
the disulfide bridges of the F(ab')2 fragments. Alternatively, Fab expression libraries may be 
30 constructed to allow rapid and easy identification of monoclonal Fab fragments with the desired 
specificity. (See, e.g., Huse, W.D. et al. (1989) Science 246:1275-1281.) 

Various immunoassays may be used for screening to identify antibodies having the 
desired specificity. Numerous protocols for competitive binding or immunoradiometric assays 
using either polyclonal or monoclonal antibodies with established specificities are well known in 
35 the art. Such immunoassays typically involve the measurement of complex formation between 
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CSIGP and its specific antibody. A two-site, monoclonal-based immunoassay utilizing 
. monoclonal antibodies reactive to two non-interfering CSIGP epitopes is preferred, but a 
competitive binding assay may also be employed (Pound, supra ). 

Various methods such as Scatchard analysis in conjunction with radioimmunoassay 
5 techniques may be used to assess the affinity of antibodies for ABBR. Affinity is expressed as an 
association constant, K^, which is defined as the molar concentration of ABBR-antibody complex 
divided by the molar concentrations of free antigen and free antibody under equilibrium 
conditions. The Kg determined for a preparation of polyclonal antibodies, which are 
heterogeneous in their affinities for multiple ABBR epitopes, represents the average affinity, or 

10 avidity, of the antibodies for ABBR. The determined for a preparation of monoclonal 

antibodies, which are monospecific for a particular ABBR epitope, represents a true measure of 
affinity. High-affinity antibody preparations with ranging from about 10^ to 10'^ L/mole are 
preferred for use in immunoassays in which the ABBR-antibody complex must withstand rigorous 
manipulations. Low-affinity antibody preparations with ranging from about 10^ to 10^ L/mole 

15 are preferred for use in immunopurification and similar procedures which ultimately require - 
dissociation of ABBR, preferably in active form, from the antibody (Catty, D. (1988) Antibodies, 
Volume 1: A Practical Approach . IRL Press, Washington DC; Liddell, J. E. and Cryer, A. (1991) 
A Practical Guide to Monoclonal Antibodies . John Wiley & Sons, New York NY). 

The titer and avidity of polyclonal antibody preparations may be further evaluated to 

20 determine the quality and suitability of such preparations for certain downstream applications.- For 
example, a polyclonal antibody preparation containing at least 1-2 mg specific antibody/ml, 
preferably 5-10 mg specific antibody/mL is preferred for use in procedures requiring precipitation 
of ABBR-antibody complexes. Procedures for evaluating antibody specificity, titer, and avidity, 
and guidelines for antibody quality and usage in various applications, are generally available. 

25 (See, e.g., Catty, supra , and Coligan et al. supra . > 

In another embodiment of the invention, the polynucleotides encoding CSIGP, or any 
fragment or complement thereof, may be used for therapeutic purposes. In one aspect, the 
complement of the polynucleotide encoding CSIGP may be used in situations in which it would be 
desirable to block the transcription of the mRNA. In particular, cells may be transformed with 

30 sequences complementary to polynucleotides encoding CSIGP. Thus, complementary molecules 
or fragments may be used to modulate CSIGP activity, or to achieve regulation of gene function. 
Such technology is now well known in the art, and sense or antisense oligonucleotides or larger 
fragments can be designed from various locations along the coding or control regions of sequences 
encoding CSIGP. . ^ 

^5 Expression vectors derived from retroviruses, adenoviruses, or herpes or vaccinia viruses, 
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or from various bacterial plasmids, may be used for delivery of nucleotide sequences to the 
targeted organ, tissue, or cell population. Methods which are well known to those skilled in the art 
can be used to construct vectors to express nucleic acid sequences complementary to the 
polynucleotides encoding CSIGP. (See, e.g., Sambrook, supra : Ausubel, 1995, supra .) 
5 Genes encoding CSIGP can be turned off by transforming a cell or tissue with expression 

vectors which express high levels of a polynucleotide, or fragment thereof, encoding CSIGP. 
Such constructs may be used to introduce untranslatable sense or antisense sequences into a cell. 
Even in the absence of integration into the DNA, such vectors may continue to transcribe RNA 
molecules until they are disabled by endogenous nucleases. Transient expression may last for a 

10 month or more with a non-replicating vector, and may last even longer if appropriate replication 
elements are part of the vector system. 

As mentioned above, modifications of gene expression can be obtained by designing 
complementary sequences or antisense molecules (DNA, RNA, or PNA) to the control, 5', or 
regulatory regions of the gene encoding CSIGP. Oligonucleotides derived from the transcription 

15 initiation site, e.g., between about positions -10 and +10 from the start site, are preferred. 

Similarly, inhibition can be achieved using triple helix base-pairing methodology. Triple helix 
pairing is useful because it causes inhibition of the ability of the double helix to open sufficiently 
for the binding of polymerases, transcription factors, or regulatory molecules. Recent therapeutic 
advances using triplex DNA have been described in the literature, (See, e.g., Gee, J.E. et al. 

20 (1994) in Huber, B.E. and B.I. Carr, Molecular and Immunologic Approaches , Futura Publishing, 
Mt. Kisco NY, pp. 163-177.) A complementary sequence or antisense molecule may also be 
designed to block translation of mRNA by preventing the transcript from binding to ribosomes. 

Ribozymes, enzymatic RNA molecules, may also be used to catalyze the specific cleavage 
of RNA. The mechanism of ribozyme action involves sequence-specific hybridization of the 

25 ribozyme molecule to complementary target RNA, followed by endonucleolytic cleavage. For 
example, engineered hammerhead motif ribozyme molecules may specifically and efficiently 
catalyze endonucleolytic cleavage of sequences encoding CSIGP. 

Specific ribozyme cleavage sites within any potential RNA target are initially identified by 
scanning the target molecule for ribozyme cleavage sites, including the following sequences: 

30 GUA, GUU, and GUC. Once identified, short RNA sequences of between 15 and 20 

ribonucleotides, corresponding to the region of the target gene containing the cleavage site, may 
be evaluated for secondary structural features which may render the oligonucleotide inoperable. 
The suitability of candidate targets may also be evaluated by testing accessibility to hybridization 
with complementary oligonucleotides using ribonuclease protection assays. 

35 Complementary ribonucleic acid molecules and ribozymes of the invention may be 
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prepared by any method known in the art for the synthesis of nucleic acid molecules. These 
include techniques for chemically synthesizing oligonucleotides such as solid phase 
phosphoramidite chemical synthesis. Alternatively, RNA molecules may be generated bv in vitro 
and in vivo transcription of DNA sequences encoding CSIGP. Such DNA sequences may be 
5 incorporated into a wide variety of vectors with suitable RNA polymerase promoters such as T7 or 
SP6. Alternatively, these cDNA constructs that synthesize complementary RNA, constitutively or 
inducibly, can be introduced into cell lines, cells, or tissues. 

RNA molecules may be modified to increase intracellular stability and half-life. Possible 
modifications include, but are not limited to, the addition of flanking sequences at the 5' and/or 3* 
10 ends of the molecule, or the use of phosphorothioate or 2* O-methyl rather than phosphodiesterase 
linkages within the backbone of the molecule. This concept is inherent in the production of PNAs 
and can be extended in all of these molecules by the inclusion of nontraditional bases such as 
inosine, queosine, and wybutosine, as well as 

acetyl-, methyl-, thio-, and similarly modified forms of adenine, cytidine, guanine, thvmine, and 

1 5 uridine which are not as easily recognized by endogenous endonucleases. 

Many methods for introducing vectors into cells or tissues are available and equally 
suitable for use in vivo , in vitro , and ex vivo . For ex vivo therapy, vectors may be introduced into 
stem cells taken from the patient and clonally propagated for autologous transplant back into that 
same patient. Delivery by transfection, by liposome injections, or by polycationic amino polymers 

20 may be achieved using methods which are well known in the art. (See, e.g., Goldman, C.K. et kl. 
(1997) Nature Biotechnology 15:462-466.) 

Any of the therapeutic methods described above may be applied to any subject in need of 
such therapy, including, for example, mammals such as dogs, cats, cows, horses, rabbits, 
monkeys, and most preferably, humans. 

25 An additional embodiment of the invention relates to the administration of a 

pharmaceutical or sterile composition, in conjunction with a pharmaceutically acceptable carrier, 
for any of the therapeutic effects discussed above. Such pharmaceutical compositions may consist 
of CSIGP, antibodies to CSIGP, and mimetics, agonists, antagonists, or inhibitors of CSIGP. The 
compositions may be administered alone or in combination with at least one other agent, such as a 

30 stabilizing compound, which may be administered in any sterile, biocompatible pharmaceutical 
carrier including, but not limited to, saline, buffered saline, dextrose, and water. The compositions 
may be administered to a patient alone, or in combination with other agents, drugs, or hormones. 

The pharmaceutical compositions utilized in this invention may be administered by any 
number of routes including, but not limited to, oral, intravenous, intramuscular, intra-arterial, 

35 intramedullary, intrathecal, intraventricular, transdermal, subcutaneous, intraperitoneal, intranasal, 
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enteral, topical, sublingual, or rectal means. 

In addition to the active ingredients, these pharmaceutical compositions may contain 
suitable pharmaceutically-acceptable carriers comprising excipients and auxiliaries which 
facilitate processing of the active compounds into preparations which can be used 
5 pharmaceutically. Further details on techniques for formulation and administration may be found 
in the latest edition of Reminuton's Pharmaceutical Sciences (Maack Publishing, Easton PA). 

Pharmaceutical compositions for oral administration can be formulated using 
pharmaceutically acceptable carriers well known in the art in dosages suitable for oral 
administration. Such carriers enable the pharmaceutical compositions to be formulated as tablets, 
10 pills, dragees, capsules, liquids, gels, syrups, slurries, suspensions, and the like, for ingestion by 
the patient. 

Pharmaceutical preparations for oral use can be obtained through combining active 
compounds with solid excipient and processing the resultant mixture of granules (optionally, after 
grinding) to obtain tablets or dragee cores. Suitable auxiliaries can be added, if desired. Suitable 

15 excipients include carbohydrate or protein fillers, such as sugars, including lactose, sucrose, 
mannitol, and sorbitol; starch from com, wheat, rice, potato, or other plants; cellulose, such as 
methyl cellulose, hydroxypropylmethyl-cellulose, or sodium carboxymethylcellulose; gums, 
including arabic and tragacanth; and proteins, such as gelatin and collagen. If desired, 
disintegrating or solubilizing agents may be added, such as the cross-linked polyvinyl pyrrolidone, 

20 agar, and alginic acid or a salt thereof, such as sodium alginate. 

Dragee cores may be used in conjunction with suitable coatings, such as concentrated 
sugar solutions, which may also contain gum arable^ talc, polyvinylpyrrolidone, carbopol gel, 
polyethylene glycol, and/or titanium dioxide, lacquer solutions, and suitable organic solvents or 
solvent mixtures. Dyestuffs or pigments may be added to the tablets or dragee coatings for 

25 product identification or to characterize the quantity of active compound, i.e., dosage. 

Pharmaceutical preparations which can be used orally include push-fit capsules made of 
gelatin, as well as soft, sealed capsules made of gelatin and a coating, such as glycerol or sorbitol. 
Push-fit capsules can contain active ingredients mixed with fillers or binders, such as lactose or 
starches, lubricants, such as talc or magnesium stearate, and, optionally, stabilizers. In soft 

30 capsules, the active compounds may be dissolved or suspended in suitable liquids, such as fatty 
oils, liquid, or liquid polyethylene glycol with or without stabilizers. 

Pharmaceutical formulations suitable for parenteral administration may be formulated in 
aqueous solutions, preferably in physiologically compatible buffers such as Hanks' solution. 
Ringer's solution, or physiologically buffered saline. Aqueous injection suspensions may contain 

35 substances which increase the viscosity of the suspension, such as sodium carboxymethyl 
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cellulose, sorbitol, or dextran. Additionally, suspensions of the active compounds may be 
prepared as appropriate oily injection suspensions. Suitable lipophilic solvents or vehicles include 
fatty oils, such as sesame oiL or synthetic fatty acid esters, such as ethyl oleate, triglycerides, or 
liposomes. Non-lipid polycationic amino polymers may also be used for delivery. Optionally, the 
5 suspension may also contain suitable stabilizers or agents to increase the solubility of the 
compounds and allow for the preparation of highly concentrated solutions. 

For topical or nasal administration, penetrants appropriate to the particular barrier to be 
permeated are used in the formulation. Such penetrants are generally known in the art. 

The pharmaceutical compositions of the present invention may be manufactured in a 
10 manner that is known in the art, e.g., by means of conventional mixing, dissolving, granulating, 
dragee-making, levigating, emulsifying, encapsulating, entrapping, or lyophilizing processes. 

The pharmaceutical composition may be provided as a salt and can be formed with many 
acids, including but not limited to, hydrochloric, sulfuric, acetic, lactic, tartaric, malic, and 
succinic acid. Salts tend to be more soluble in aqueous or other protonic solvents than are the 
15 corresponding free base forms. In other cases, the preferred preparation may be a iyophilized - 
powder which may contain any or all of the following: 1 mM to 50 mM histidine, 0.1% to 2%v/*' 
sucrose, and 2% to 7% mannitoi, at a pH range of 4.5 to 5.5, that is combined with buffer prior to 
use. 

After pharmaceutical compositions have been prepared, they can be placed in an v 
20 appropriate container and labeled for treatment of an indicated condition. For administration of 
CSIGP, such labeling would include amount, frequency, and method of administration. 

Pharmaceutical compositions suitable for use in the invention include compositions 
wherein the active ingredients are contained in an effective amount to achieve the intended 
purpose. The determination of an effective dose is well within the capability of those skilled in the 
25 art. 

For any compound, the therapeutically effective dose can be estimated initially either in 
cell culture assays, e.g., of neoplastic cells or in animal models such as mice, rats, rabbits, dogs, or 
pigs. An animal model may also be used to determine the appropriate concentration range and 
route of administration. Such information can then be used to determine useful doses and routes 

30 for administration in humans. 

A therapeutically effective dose refers to that amount of active ingredient, for example 
CSIGP or fragments thereof, antibodies of CSIGP, and agonists, antagonists or inhibitors of 
CSIGP, which ameliorates the symptoms or condition. Therapeutic efficacy and toxicity may be 
determined by standard pharmaceutical procedures in cell cultures or with experimental animals, 

35 such as by calculating the ED,o (the dose therapeutically effective in 50% of the population) or 
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LD50 (the dose lethal to 50% of the population) statistics. The dose ratio of toxic to therapeutic 
effects is the therapeutic index, and it can be expressed as the LD50/ ED50 ratio. Pharmaceutical 
compositions which exhibit large therapeutic indices are preferred. The data obtained from cell 
culture assays and animal studies are used to formulate a range of dosage for human use. The 
5 dosage contained in such compositions is preferably within a range of circulating concentrations 
that includes the EDj^ with Httle or no toxicit>'. The dosage varies within this range depending 
upon the dosage form employed, the sensitivity of the patient, and the route of administration. 

The exact dosage will be determined by the practitioner, in light of factors related to the 
subject requiring treatment. Dosage and administration are adjusted to provide sufficient levels of 

10 the active moiety or to maintain the desired effect. Factors which may be taken into account 
include the severity of the disease state, the general health of the subject, the age, weight, and 
gender of the subject, time and frequency of administration, drug combination(s), reaction 
sensitivities, and response to therapy. Long-acting pharmaceutical compositions may be 
administered every 3 to 4 days, every week, or biweekly depending on the half-life and clearance 

1 5 rate of the particular formulation. 

Normal dosage amounts may vary from about 0. \ /ugio \ 00,000 A^g, up to a total dose of 
about 1 gram, depending upon the route of administration. Guidance as to particular dosages and 
methods of delivery is provided in the literature and generally available to practitioners in the art. 
Those skilled in the art will employ different formulations for nucleotides than for proteins or their 

20 inhibitors. Similarly, delivery of polynucleotides or polypeptides will be specific to particular 
cells, conditions, locations, etc. 
DIAGNOSTICS 

In another embodiment, antibodies which specifically bind CSIGP may be used for the 
diagnosis of cell proliferative and inflammatory disorders characterized by expression of CSIGP, 
25 or in assays to monitor patients being treated with CSIGP or agonists, antagonists, or inhibitors of 
CSIGP. Antibodies useful for diagnostic purposes may be prepared in the same manner as 
described above for therapeutics. Diagnostic assays for CSIGP include methods which utilize the 
antibody and a label to detect CSIGP in human body fluids or in extracts of cells or tissues. The 
antibodies may be used with or without modification, and may be labeled by covaient or non- 
30 covalent attachment of a reporter molecule. A wide variety of reporter molecules, several of which 
are described above, are known in the art and may be used. 

A variety of protocols for measuring CSIGP, including ELISAs, RIAs, and FACS, are 
known in the art and provide a basis for diagnosing altered or abnormal levels of CSIGP 
expression. Normal or standard values for CSIGP expression are established by combining body 
35 fluids or cell extracts taken from normal mammalian subjects, preferably human, with antibody to 
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CSIGP under conditions suitable for complex formation. The amount of standard complex 
formation may be quantitated by various methods, preferably by photometric means. Quantities of 
CSIGP expressed in subject, control, and disease samples from biopsied tissues are compared with 
the standard values. Deviation between standard and subject values establishes the parameters for 
5 diagnosing disease. 

In another embodiment of the invention, the polynucleotides encoding CSIGP may be 
used for diagnostic purposes. The polynucleotides which may be used include oligonucleotide 
sequences, complementary RNA and DNA molecules, and PNAs. The polynucleotides may be 
used to detect and quantitate gene expression in biopsied tissues in which expression of CSIGP 
0 may be correlated with disease. The diagnostic assay may be used to determine absence, 
presence, and excess expression of CSIGP, and to monitor regulation of CSIGP levels during 
therapeutic intervention. 

In one aspect, hybridization with PCR probes which are capable of detecting 
polynucleotide sequences, including genomic sequences, encoding CSIGP or closely related 
5 molecules may be used to identify nucleic acid sequences which encode CSIGP. The specificity 
of the probe, whether it is made from a highly specific region, e.g., the 5' regulatory region, or **> 
from a less specific region, e.g., a conserved motif, and the stringency of the hybridization or 
amplification (maximal, high, intermediate, or low), will determine whether the probe identifies ' 
only naturally occurring sequences encoding CSIGP, allelic variants, or related sequences. 

Probes may also be used for the detection of related sequences, and should preferably 
have at least 50% sequence identity to any of the CSIGP encoding sequences. The hybridization 
probes of the subject invention may be DNA or RNA and may be derived from the sequence of 
SEQ ID NO: 14-26 or from genomic sequences including promoters, enhancers, and introns of the 
CSIGP gene. 

Means for producing specific hybridization probes for DNAs encoding CSIGP include the 
cloning of polynucleotide sequences encoding CSIGP or CSIGP derivatives into vectors for the 
production of mRNA probes. Such vectors are known in the art, are commercially available, and 
may be used to synthesize RNA probes in vitro by means of the addition of the appropriate RNA 
polymerases and the appropriate labeled nucleotides. Hybridization probes may be labeled by a 
variety of reporter groups, for example, by radionuclides such as ^-P or ^^S, or by enzymatic labels, 
such as alkaline phosphatase coupled to the probe via avidin/biotin coupling systems, and the like. 

Polynucleotide sequences encoding CSIGP may be used for the diagnosis of cell 
proliferative and inflammatory disorders associated with expression of CSIGP. Examples of such 
disorders include, but are not limited to, a disorder of cell proliferation such as actinic keratosis, 
arteriosclerosis, atherosclerosis, bursitis, cirrhosis, hepatitis, mixed connective tissue disease 
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(MCTD), myelofibrosis, paroxysmal nocturnal hemoglobinuria, polycythemia vera, psoriasis, 
primary thrombocythemia; cancers including adenocarcinoma, leukemia, lymphoma, melanoma, 
myeloma, sarcoma, teratocarcinoma, and, in particular, cancers of the adrenal gland, bladder, 
bone, bone marrow, brain, breast, cervix, gall bladder, ganglia, gastrointestinal tract, heart, kidney, 
5 liver, lung, muscle, ovary, pancreas, parathyroid, penis, prostate, salivary glands, skin, spleen, 
testis, thymus, thyroid, and uterus; and an inflam.matory disorder such as acquired 
immunodeficiency syndrome (AIDS), Addison's disease, adult respiratory distress syndrome, 
allergies, ankylosing spondylitis, amyloidosis, anemia, asthma, atherosclerosis, autoimmune 
hemolytic anemia, autoimmune thyroiditis, bronchitis, cholecystitis, contact dermatitis, Crohn's 

10 disease, atopic dermatitis, dermatomyositis, diabetes mellitus, emphysema, episodic lymphopenia 
v^ith iymphocytotoxins, erythroblastosis fetalis, erythema nodosum, atrophic gastritis, 
glomerulonephritis, Goodpasture's syndrome, gout. Graves' disease, Hashimoto's thyroiditis, 
hypereosinophilia, irritable bowel syndrome, multiple sclerosis, myasthenia gravis, myocardial or 
pericardial inflammation, osteoarthritis, osteoporosis, pancreatitis, polymyositis, psoriasis, Reiter's 

15 syndrome, rheumatoid arthritis, scleroderma, Sjogren's syndrome, systemic anaphylaxis, systemic 
lupus erythematosus, systemic sclerosis, thrombocytopenic purpura, ulcerative colitis, uveitis, 
Werner syndrome, complications of cancer, hemodialysis, and extracorporeal circulation, viral, 
bacterial, fungal, parasitic, protozoal, and helminthic infections, and trauma. The polynucleotide 
sequences encoding CSIGP may be used in Southern or northern analysis, dot blot, or other 

20 membrane-based technologies; in PGR technologies; in dipstick, pin, and ELISA assays; and in 
microarrays utilizing fluids or tissues from patients to detect altered CSIGP expression. Such 
qualitative or quantitative methods are well known in the art. 

In a particular aspect, the nucleotide sequences encoding CSIGP may be useful in assays 
that detect the presence of associated disorders, particularly those mentioned above. The 

25 nucleotide sequences encoding CSIGP may be labeled by standard methods and added to a fluid or 
tissue sample from a patient under conditions suitable for the formation of hybridization 
complexes. After a suitable incubation period, the sample is washed and the signal is quantitated 
and compared with a standard value. If the amount of signal in the patient sample is significantly 
altered in comparison to a control sample then the presence of altered levels of nucleotide 

30 sequences encoding CSIGP in the sample indicates the presence of the associated disorder. Such 
assays may also be used to evaluate the efficacy of a particular therapeutic treatment regimen in 
animal studies, in clinical trials, or to monitor the treatment of an individual patient. 

In order to provide a basis for the diagnosis of a disorder associated with expression of 
CSIGP, a normal or standard profile for expression is established. This may be accomplished by 

35 combining body fluids or cell extracts taken from normal subjects, either animal or human, with a 
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sequence, or a fragment thereof, encoding CSIGP, under conditions suitable for hybridization or 
amplification. Standard hybridization may be quantified by comparing the values obtained from 
normal subjects with values from an experiment in which a known amount of a substantially 
purified polynucleotide is used. Standard values obtained in this manner may be compared with 
5 values obtained from samples from patients who are symptomatic for a disorder. Deviation from 
standard values is used to establish the presence of a disorder. 

Once the presence of a disorder is established and a treatment protocol is initiated, 
hybridization assays may be repeated on a regular basis to determine if the level of expression in 
the patient begins to approximate that which is observed in the normal subject. The results 
10 obtained from successive assays may be used to show the efficacy of treatment over a period 
ranging from several days to months. 

With respect to cancer, the presence of a relatively high amount of transcript in biopsied 
tissue from an individual may indicate a predisposition for the development of the disease, or may 
provide a means for detecting the disease prior to the appearance of actual clinical symptoms. A 

15 more definitive diagnosis of this type may allow health professionals to employ preventative 
measures or aggressive treatment earlier thereby preventing the development or further 
progression of the cancer. 

Additional diagnostic uses for oligonucleotides designed from the sequences encoding 
CSIGP may involve the use of PGR. These oligomers may be chemically synthesized, generated 

20 enzymatically, or produced in vitro . Oligomers will preferably contain a fragment of a 
polynucleotide encoding CSIGP, or a fragment of a polynucleotide complementary to the 
polynucleotide encoding CSIGP, and will be employed under optimized conditions for 
identification of a specific gene or condition. Oligomers may also be employed under less 
stringent conditions for detection or quantitation of closely related DNA or RNA sequences. 

25 Methods which may also be used to quantitate the expression of CSIGP include 

radiolabeling or biotinylating nucleotides, coamplification of a control nucleic acid, and 
interpolating results from standard curves. (See, e.g., Melby, P.C. et al. (1993) J. Immunol. 
Methods 159:235-244; Duplaa, C. et al. (1993) Anal. Biochem. 229-236.) The speed of 
quantitation of multiple samples may be accelerated by running the assay in an ELISA format 

30 where the oligomer of interest is presented in various dilutions and a spectrophotometric or 
colorimetric response gives rapid quantitation. 

In further embodiments, oligonucleotides or longer fragments derived from any of the 
polynucleotide sequences described herein may be used as targets in a microarray. The 
microarray can be used to monitor the expression level of large numbers of genes simultaneously 

35 and to identify genetic variants, mutations, and polymorphisms. This information may be used to 
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determine gene function, to understand the genetic basis of a disorder, to diagnose a disorder, and 
to develop and monitor the activities of therapeutic agents. 

Microarrays may be prepared, used, and analyzed using methods known in the art. (See, 
e.g., Brennan, T.M, et ai. (1995) U.S. Patent No. 5,474,796; Schena, M. et al. (1996) Proc. Natl. 
5 Acad. Sci. 93:10614-10619; Baldeschweiler et al. (1995) PCT application W095/2511 16; Shalon, 
D. et a!. (1995) PCT application WO95/35505; Heller R.A. et al. (1997) Proc. Natl. Acad. Sci. 
94:2150-2155; and Heller, M.J. et al. (1997) U.S. Patent No. 5,605,662.) 

In another embodiment of the invention, nucleic acid sequences encoding CSIGP may be 
used to generate hybridization probes useful in mapping the naturally occurring genomic 
10 sequence. The sequences may be mapped to a particular chromosome, to a specific region of a 
chromosome, or to artificial chromosome constructions, e.g., human artificial chromosomes 
(HACs), yeast artificial chromosomes (YACs), bacterial artificial chromosomes (BACs), bacterial 
PI constructions, or single chromosome cDNA libraries. (See, e.g., Harrington, J.J. et al. (1997) 
Nat Genet. 15:345-355; Price, CM. (1993) Blood Rev. 7:127-134; and Trask, B.J. (1991) Trends 
15 Genet. 7:149-154.) 

Fluorescent in situ hybridization (FISH) may be correlated with other physical 
chromosome mapping techniques and genetic map data. (See, e.g., Heinz-Ulrich, et al. (1995) in 
Meyers, supra, pp. 965-968.) Examples of genetic map data can be found in various scientific 
journals or at the Online Mendelian Inheritance in Man (OMIM) site. Correlation between the 

20 location of the gene encoding CSIGP on a physical chromosomal map and a specific disorder, or a 
predisposition to a specific disorder, may help define the region of DNA associated with that 
disorder. The nucleotide sequences of the invention may be used to detect differences in gene 
sequences among normal, carrier, and affected individuals. 

iTLsitu hybridization of chromosomal preparations and physical mapping techniques, such 

25 as linkage analysis using established chromosomal markers, may be used for extending genetic 
maps. Often the placement of a gene on the chromosome of another mammalian species, such as 
mouse, may reveal associated markers even if the number or arm of a particular human 
chromosome is not known. New sequences can be assigned to chromosomal arms by physical 
mapping. This provides valuable information to investigators searching for disease genes using 

30 positional cloning or other gene discovery techniques. Once the disease or syndrome has been 
crudely localized by genetic linkage to a particular genomic region, e.g., ataxia-telangiectasia to 
I Iq22-23, any sequences mapping to that area may represent associated or regulatory genes for 
further investigation. (See, e.g., Gatti, R.A. et al. (1988) Nature 336:577-580.) The nucleotide 
sequence of the subject invention may also be used to detect differences in the chromosomal 

35 location due to translocation, inversion, etc., among normal, carrier, or affected individuals. 
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In another embodiment of the invention, CSIGP, its catalytic or immunogenic fragments, 
or oligopeptides thereof can be used for screening libraries of compounds in any of a variety of 
drug screening techniques. The fragment employed in such screening may be free in solution, 
affixed to a solid support, borne on a cell surface, or located intracellularly. The formation of 
5 binding complexes between CSIGP and the agent being tested may be measured. 

Another technique for drug screening provides for high throughput screening of 
compounds having suitable binding affinity to the protein of interest, (See, e.g., Geysen, et al. 
(1984) PCT application WO84/03564.) In this method, large numbers of different small test 
compounds are synthesized on a solid substrate. The test compounds are reacted with CSIGP, or 
10 fragments thereof, and washed. Bound CSIGP is then detected by methods well known in the art. 
Purified CSIGP can also be coated directly onto plates for use in the aforementioned drug 
screening techniques. Alternatively, non-neutralizing antibodies can be used to capture the 
peptide and immobilize it on a solid support. 

In another embodiment, one may use competitive drug screening assays in which 
15 neutralizing antibodies capable of binding CSIGP specifically compete with a test compound for 
binding CSIGP. In this manner, antibodies can be used to detect the presence of any peptide 
which shares one or more antigenic determinants with CSIGP. - 'X" 

In additional embodiments, the nucleotide sequences which encode CSIGP may be used in 
any molecular biology techniques that have yet to be developed, provided the new techniques rely 
20 on properties of nucleotide sequences that are currently known, including, but not limited to, such 
properties as the triplet genetic code and specific base pair interactions. ^' 

Without further elaboration, it is believed that one skilled in the art can, using the 
preceding description, utilize the present invention to its fullest extent. The following preferred 
specific embodiments are, therefore, to be construed as merely illustrative, and not limitative of 
25 the remainder of the disclosure in any was whatsoever. 

The entire disclosure of all applications, patents, and publications, cited above and below, 
and of US provisional applications 60/085,343 (filed May 13, 1998), and 60/098,010 (filed 
August 26, 1998) are hereby incorporated by reference. 

EXAMPLES 

30 1. Construction of cDNA Libraries 

RNA was purchased from Clontech or isolated from tissues described in Table 4. Some 
tissues were homogenized and lysed in guanidinium isothiocyanate, while others were 
homogenized and lysed in phenol or in a suitable mixture of denaturants, such as TRIZOL (Life 
Technologies), a monophasic solution of phenol and guanidine isothiocyanate. The resulting 
35 iysates were centrifuged over CsCl cushions or extracted with chloroform. RNA was precipitated 
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from the lysates with either isopropanol or sodium acetate and ethanol, or by other routine 
methods. 

Phenol extraction and precipitation of RNA were repeated as necessary to increase RNA 
purity. In some cases, RNA was treated with DNase. For most libraries, poly(A+) RNA was 
5 isolated using oligo d(T)-coupled paramagnetic particles (Promega), OLIGOTEX latex particles 
(QIAGEN, Valencia CA), or an OLIGOTEX mRNA purification kit (QIAGEN). Alternatively, 
RNA was isolated directly from tissue lysates using other RNA isolation kits, e.g., the 
POLY(A)PURE mRNA purification kit (Ambion, Austin TX). 

In some cases, Stratagene was provided with RNA and constructed the corresponding 
10 cDNA libraries. Otherwise, cDNA was synthesized and cDNA libraries were constructed with the 
UNIZAP vector system (Stratagene) or SUPERSCRIPT plasmid system (Life Technologies), 
using the recommended procedures or similar methods known in the art. (See, e.g., Ausubel, 
1997, supra, units 5.1-6.6). Reverse transcription was initiated using oligo d(T) or random 
primers. Synthetic oligonucleotide adapters were ligated to double stranded cDNA, and the cDNA 

15 was digested with the appropriate restriction enzyme or enzymes. For most libraries, the cDNA 
was size-selected (300-1000 bp) using SEPHACRYL SIOOO, SEPHAROSE CL2B, or 
SEPHAROSE CL4B column chromatography (Amersham Pharmacia Biotech) or preparative 
agarose gel electrophoresis. cDNAs were ligated into compatible restriction enzyme sites of the 
polylinker of a suitable plasmid, e.g., PBLUESCRIPT plasmid (Stratagene), pSPORTl plasmid 

20 (Life Technologies), or pINCY (Incyte Pharmaceuticals, Palo Alto CA). Recombinant plasmids 
were transformed into competent E. coli cells including XL 1 -Blue, XLl-BlueMRF, or SOLR from 
Stratagene or DHSa, DHl OB, or ElectroMAX DH lOB from Life Technologies. 
II. Isolation of cDNA Clones 

Plasmids were recovered from host cells by in vivo excision, using the UNIZAP vector 

25 system (Stratagene) or cell lysis. Plasmids were purified using at least one of the following: a 
Magic or WIZARD Minipreps DNA purification system (Promega); an AGTC Miniprep 
purification kit (Edge Biosystems, Gaithersburg MD); and QIAWELL 8 Plasmid, QIAWELL 8 
Plus Plasmid, QIAWELL 8 Ultra Plasmid purification systems or the REAL Prep 96 plasmid kit 
from QIAGEN. Following precipitation, plasmids were resuspended in O.l ml of distilled water 

30 and stored, with or without lyophilization, at 4°C. 

Alternatively, plasmid DNA was amplified from host cell lysates using direct link PGR in 
a high-throughput format (Rao, V.B. ( 1 994) Anal. Biochem. 216:1-14). Host cell lysis and 
thermal cycling steps were carried out in a single reaction mixture. Samples were processed and 
stored in 384-weil plates, and the concentration of amplified plasmid DNA was quantified 

35 fluorometrically using PICOGREEN dye (Molecular Probes, Eugene OR) and a Fluoroskan 11 
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fluorescence scanner (Labsystems Oy, Helsinki, Finland), 
in. Sequencing and Analysis 

The cDNAs were prepared for sequencing using either an ABI CATALYST 800 (Perkin- 
Elmer) or a HYDRA microdispenser (Robbins) or MICROLAB 2200 (Hamilton) sequencing 
5 preparation system in combination with PTC-200 thermal cyclers (MJ Research). The cDNAs 
were sequenced using the ABI PRISM 373 or 377 sequencing systems of the MEGABACE 1000 
DNA sequencing system (Molecular Dynamics) and ABI protocols, base calling software, and kits 
(Perkin-Elmer). Alternatively, solutions and dyes from Amersham Pharmacia Biotech were used. 
Reading frames were determined using standard methods (Ausubel, 1997, supra ). Some of the 

10 cDNA sequences were selected for extension using the techniques disclosed in Example V. 

The polynucleotide sequences derived from cDNA, extension, and shotgun sequencing 
were assembled and analyzed using a combination of software programs which utilize algorithms 
well known to those skilled in the art. Table 5 summarizes the software programs, descriptions, 
references, and threshold parameters used. The first column of Table 5 shows the tools, programs, 

15 and algorithms used, the second column provides a brief description thereof, the third column-^^ 
presents the references which are incorporated by reference herein, and the fourth column 
presents, where applicable, the scores, probability values, and other parameters used to evaluate 
the strength of a match between two sequences (the higher the probability the greater the 
homology). Sequences were analyzed using MACDNASIS PRO software (Hitachi Software>^ 

20 Engineering, S. San Francisco CA) and LASERGENE software (DNASTAR). 

cDNAs were also compared to sequences in GenBank using a search algorithm developed 
by Applied Biosystems and incorporated into the INHERIT'" 670 sequence analysis system. In 
this algorithm. Pattern Specification Language (TRW Inc, Los Angeles, CA) was used to 
determine regions of homology. The three parameters that determine how the sequence 

25 comparisons run were window size, window offset, and error tolerance. Using a combination of 
these three parameters, the DNA database was searched for sequences containing regions of 
homology to the query sequence, and the appropriate sequences were scored with an initial value. 
Subsequently, these homologous regions were examined using dot matrix homology plots to 
distinguish regions of homology from chance matches. Smith- Waterman alignments were used to 

30 display the results of the homology search. 

Peptide and protein sequence homologies were ascertained using the INHERIT- 670 
sequence analysis system using the methods similar to those used in DNA sequence homologies. 
Pattern Specification Language and parameter windows were used to search protein databases for 
sequences containing regions of homology which were scored with an initial value. Dot-matrix 

35 homology plots were examined to distinguish regions of significant homology from chance 
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matches. 

The polynucleotide sequences were validated by removing vector, linker, and polyA 
sequences and by masking ambiguous bases, using algorithms and programs based on BLAST, 
dynamic programing, and dinucleotide nearest neighbor analysis. The sequences v^ere then 
5 queried against a selection of public databases such as GenBank primate, rodent, mammalian, 
vertebrate, and eukaryote databases, and BLOCKS to acquire annotation, using programs based 
on BLAST, FASTA, and BLIMPS. The sequences were assembled into full length polynucleotide 
sequences using programs based on Phred, Phrap, and Consed, and were screened for open 
reading frames using programs based on GeneMark, BLAST, and FASTA. The full length 
10 polynucleotide sequences were translated to derive the corresponding full length amino acid 
sequences, and these full length sequences were subsequently analyzed by querying against 
databases such as the GenBank databases (described above), SwissProt, BLOCKS, PRINTS, 
PFAM, and Prosite. 

The programs described above for the assembly and analysis of full length polynucleotide 
15 and amino acid sequences were also used to identify polynucleotide sequence fragments from 
SEQ ID NO: 14-26. Fragments from about 20 to about 4000 nucleotides which are useful in 
hybridization and amplification technologies were described in The Invention section above. 
rV. Northern Analysis 

Northern analysis is a laboratory technique used to detect the presence of a transcript of a 
20 gene and involves the hybridization of a labeled nucleotide sequence to a membrane on which 
RNAs from a particular cell type or tissue have been bound. (See, e.g., Sam brook, supra , ch. 7; 
Ausubel, 1995, supra , ch. 4 and 16.) 

Analogous computer techniques applying BLAST were used to search for identical or 
related molecules in nucleotide databases such as GenBank or LIFESEQ database (Incyte 
25 Pharmaceuticals). This analysis is much faster than multiple membrane-based hybridizations. In 
addition, the sensitivity of the computer search can be modified to determine whether any 
particular match is categorized as exact or similar. The basis of the search is the product score, 
which is defined as: % sequence identity x % maximum BLAST score 

ICQ 

30 The product score takes into account both the degree of similarity between two sequences 

and the length of the sequence match. For example, with a product score of 40, the match will be 
exact within a 1% to 2% error, and, with a product score of 70, the match will be exact. Similar 
molecules are usually identified by selecting those which show product scores between 15 and 40, 
although lower scores may identify related molecules. 

35 The results of northern analyses are reported a percentage distribution of libraries in which 
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the transcript encoding CSIGP occurred. Analysis involved the categorization of cDNA libraries 
by organ/tissue and disease. The organ/tissue categories included cardiovascular, dermatologic, 
developmental, endocrine, gastrointestinal, hematopoietic/immune, musculoskeletaL nervous, 
reproductive, and urologic. The disease or condition categories included cancer, 
inflammation/trauma, cell proliferation, neurological, and pooled. For each category, the number 
of libraries expressing the sequence of interest was counted and divided by the total number of 
libraries across all categories. Percentage values of tissue-specific and disease expression are 
reported in Table 3. 

V. Extension of CSIGP Encoding Polynucleotides 

The full length nucleic acid sequence of SEQ ID NO: 1 4-26 was produced by extension of 
an appropriate fragment of the full length molecule using oligonucleotide primers designed from 
this fragment. One primer was synthesized to initiate 5' extension of the known fragment, and the 
other primer, to initiate 3' extension of the known fragment. The initial primers were designed 
using OLIGO 4.06 software (National Biosciences), or another appropriate program, to be about 
22 to 30 nucleotides in length, to have a GC content of about 50% or more, and to anneal to the 
target sequence at temperatures of about 68°C to about 72°C. Any stretch of nucleotides which^ 
would result in hairpin structures and primer-primer dimerizations was avoided. 

Selected human cDNA libraries were used to extend the sequence. If more than one 
extension was necessary or desired, additional or nested sets of primers were designed. 



High fidelity amplification was obtained by PGR using methods well known in the art. " 
PGR was performed in 96- well plates using the PTC-200 thermal cycler (MJ Research, Inc.). Tfie 
reaction mix contained DNA template, 200 nmol of each primer, reaction buffer containing Mg-^, 
(NH4)2S04, and p-mercaptoethanol, Taq DNA polymerase (Amersham Pharmacia Biotech), 
ELONGASE enzyme (Life Technologies), and Pfu DNA polymerase (Stratagene), with the 
following parameters for primer pair PCI A and PGI B: Step I: 94°G, 3 min; Step 2: 94 15 sec; 
Step 3: 60°G, 1 min; Step 4: 68°C, 2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; Step 6: 
68°C, 5 min; Step 7: storage at 4°C. In the alternative, the parameters for primer pair T7 and SK-H 
were as follows: Step 1: 94^C, 3 min; Step 2: 94°G, 15 sec; Step 3: 57X, I min; Step 4: 68X, 2 
min; Step 5: Steps 2, 3, and 4 repeated 20 times; Step 6: 68°C, 5 min; Step 7: storage at 4°C. 

The concentration of DNA in each well was determined by dispensing 100 ^l 
PICOGREEN quantitation reagent (0.25% (v/v) PIGOGREEN; Molecular Probes, Eugene OR) 
dissolved in IX TE and 0.5 (il of undiluted PGR product into each well of an opaque fluorimeter 
plate (Goming Gostar, Acton MA), allowing the DNA to bind to the reagent. The plate was 
scanned in a Fluoroskan II (Labsystems Oy, Helsinki, Finland) to measure the fluorescence of the 
sample and to quantify the concentration of DNA. A 5 /^l to \0 ju\ aliquot of the reaction mixture 
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was analyzed by electrophoresis on a 1 % agarose mini-gel to determine which reactions were 
successful in extending the sequence. 

The extended nucleotides were desalted and concentrated, transferred to 384-weli plates, 
digested with CviJI cholera virus endonuclease (Molecular Biology Research, Madison WI), and 
5 sonicated or sheared prior to religation into pUC 1 8 vector (Amersham Pharmacia Biotech). For 
shotgun sequencing, the digested nucleotides were separated on low concentration (0.6 to 0.8%) 
agarose gels, fragments were excised, and agar digested with Agar ACE (Promega). Extended 
clones were religated using T4 ligase (New England Biolabs, Beverly MA) into pUC 18 vector 
(Amersham Pharmacia Biotech), treated with Pfu DNA polymerase (Stratagene) to fill-in 

10 restriction site overhangs, and transfected into competent E. coH cells. Transformed cells were 
selected on antibiotic-containing media, individual colonies were picked and cultured overnight at 
Sl'^C in 384-well plates in LB/2x carb liquid media. 

The cells were lysed, and DNA was amplified by PGR using Taq DNA polymerase 
(Amersham Pharmacia Biotech) and Pfu DNA polymerase (Stratagene) with the following 

15 parameters: Step I: 94°C, 3 min; Step 2: 94°C, 15 sec; Step 3: 60X, 1 min; Step 4: 2 min; 

Step 5: steps 2, 3, and 4 repeated 29 times; Step 6: 72 °C, 5 min; Step 7: storage at 4X. DNA was 
quantified by PICOGREEN reagent (Molecular Probes) as described above. Samples with low 
DNA recoveries were reamplified using the same conditions as described above. Samples were 
diluted with 20% dimethysulphoxide (1 :2, v/v), and sequenced using DYENAMIC energy transfer 

20 sequencing primers and the DYENAMIC DIRECT kit (Amersham Pharmacia Biotech) or the ABl 
PRISM BIGDYE Terminator cycle sequencing ready reaction kit (Perkin-Eimer). 

In like manner, the nucleotide sequence of SEQ ID NO: 14-26 is used to obtain 5* 
regulatory sequences using the procedure above, oligonucleotides designed for such extension, 
and an appropriate genomic library. 

25 VI. Choice, Labeling and Use of Individual Hybridization Probes 

Hybridization probes derived from SEQ ID NO: 14-26 are employed to screen cDNAs, 
genomic DNAs, or mRNAs. Although the labeling of oligonucleotides, consisting of about 20 
base pairs, is specifically described, essentially the same procedure is used with larger nucleotide 
fragments. Oligonucleotides are designed using state-of-the-art software such as OLIGO 4.06 

30 software (National Biosciences) and labeled by combining 50 pmol of each oligomer, 250 juCi of 
[y-^-P] adenosine triphosphate (Amersham Pharmacia Biotech), and T4 polynucleotide kinase 
(DuPont NEN, Boston MA). The labeled oligonucleotides are substantially purified using a 
SEPHADEX G-25 superfine size exclusion dextran bead column (Amersham Pharmacia Biotech). 
An aliquot containing 10' counts per minute of the labeled probe is used in a typical membrane- 

35 based hybridization analysis of human genomic DNA digested with one of the following 
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endonucleases: Ase I, Bgl II, Eco Rl, Pst I, Xbal, or Pvu II (DuPont NEN). 

The DNA from each digest is fractionated on a 0.7% agarose gei and transferred to nylon 
membranes (Nytran Plus, Schleicher & Schueli, Durham NH). Hybridization is carried out for 16 
hours at 40 ''C. To remove nonspecific signals, blots are sequentially washed at room temperature 
5 under increasingly stringent conditions up to 0.1 x saline sodium citrate and 0.5% sodium dodecyl 
sulfate. After XOMAT-AR film (Eastman Kodak, Rochester NY) is exposed to the blots to film 
for several hours, hybridization patterns are compared visually. 

VII. Microarrays 

A chemical coupling procedure and an Inkjet device can be used to synthesize array 
10 elements on the surface of a substrate. (See, e.g., Baldeschweiier, supra .) An array analogous to a 
dot or slot blot may also be used to arrange and link elements to the surface of a substrate using 
thermal, UV, chemical, or mechanical bonding procedures. A typical array may be produced by 
hand or using available methods and machines and contain any appropriate number of elements. 
After hybridization, nonhybridized probes are removed and a scanner used to determine the levels 
15 and patterns of fluorescence. The degree of complementarity and the relative abundance of each 
probe which hybridizes to an element on the microarray may be assessed through analysis of the 
scanned images. 

Full-length cDNAs, Expressed Sequence Tags (ESTs), or fragments thereof may 
comprise the elements of the microarray. Fragments suitable for hybridization can be selected' 

20 using software well known in the art such as LASERGENE software (DNASTAR). Full-length 
cDNAs, ESTs, or fragments thereof corresponding to one of the nucleotide sequences of the 
present invention, or selected at random from a cDNA library relevant to the present invention, are 
arranged on an appropriate substrate, e.g., a glass slide. The cDNA is fixed to the slide using, e.g., 
UV cross-linking followed by thermal and chemical treatments and subsequent drying. (See, e.g., 

25 Schena, M. et al. (1995) Science 270:467-470; Shalon, D. et al. (1996) Genome Res. 6:639-645.) 
Fluorescent probes are prepared and used for hybridization to the elements on the substrate. The 
substrate is analyzed by procedures described above. 

VIII. Complementary Polynucleotides 

Sequences complementary to the CSIGP-encoding sequences, or any parts thereof, are 
30 used to detect, decrease, or inhibit expression of naturally occurring CSIGP. Although use of 
oligonucleotides comprising from about 15 to 30 base pairs is described, essentially the same 
procedure is used with smaller or with larger sequence fragments. Appropriate oligonucleotides 
are designed using OLIGO 4.06 software (National Biosciences) and the coding sequence of 
CSIGP. To inhibit transcription, a complementary oligonucleotide is designed from the most 
35 unique 5' sequence and used to prevent promoter binding to the coding sequence. To inhibit 
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translation, a complementary oligonucleotide is designed to prevent ribosomal binding to the 
CSIGP-encoding transcript 
IX^ Expression of CSIGP 

Expression and purification of CSIGP is achieved using bacterial or virus-based 
5 expression systems. For expression of CSIGP in bacteria, cDNA is subcioned into an appropriate 
vector containing an antibiotic resistance gene and an inducible promoter that directs high levels 
of cDNA transcription. Examples of such promoters include, but are not limited to, the trp-lac 
{tac) hybrid promoter and the T5 or T7 bacteriophage promoter in conjunction with the iac 
operator regulatory element. Recombinant vectors are transformed into suitable bacterial hosts, 

10 e.g., BL21(DE3). Antibiotic resistant bacteria express CSIGP upon induction with isopropyl beta- 
D-thiogalactopyranoside (IPTG). Expression of CSIGP in eukaryotic ceils is achieved by 
infecting insect or mammalian cell lines with recombinant Autoeraohica califomica nuclear 
polyhedrosis virus (AcMNPV), commonly known as baculovirus. The nonessential polyhedrin 
gene of baculovirus is replaced with cDNA encoding CSIGP by either homologous recombination 

15 or bacterial-mediated transposition involving transfer plasmid intermediates. Viral infectivity is 
maintained and the strong polyhedrin promoter drives high levels of cDNA transcription. 
Recombinant baculovirus is used to infect Spodoptera frugioerda (Sf9) insect cells in most cases, 
or human hepatocytes, in some cases. Infection of the latter requires additional genetic 
modifications to baculovirus. (See Engelhard, E. K. et al. (1994) Proc. Natl. Acad. Sci. USA 

20 91:3224-3227; Sandig, V. et al. (1996) Hum. Gene Ther. 7:1937-1945.) 

In most expression systems, CSIGP is synthesized as a fusion protein with, e.g., 
glutathione S-transferase (GST) or a peptide epitope tag, such as FLAG or 6-His, permitting rapid, 
single-step, affinity-based purification of recombinant fusion protein from crude cell lysates. 
GST, a 26-kilodalton enzyme from Schistosoma iaponicum . enables the purification of fusion 

25 proteins on immobilized glutathione under conditions that maintain protein activity and 

antigenicity (Amersham Pharmacia Biotech). Following purification, the GST moiety can be 
proteolytically cleaved from CSIGP at specifically engineered sites. FLAG, an 8-amino acid 
peptide, enables immunoaffinity purification using commercially available monoclonal and 
polyclonal anti-FLAG antibodies (Eastman Kodak). 6-His, a stretch of six consecutive histidine 

30 residues, enables purification on metal-cheiate resins (QIAGEN). Methods for protein expression 
and purification are discussed in Ausubel (1995, supra , ch 10 and 16). Purified CSIGP obtained 
by these methods can be used directly in the following activity assay. 
X. Demonstration of CSIGP Activity 

CSIGP activity can be assayed m vitro by monitoring the mobilization of Ca^ as part of 

35 the signal transduction pathway. (See, e.g., Grynkievwicz, G. et al. (1985) J. Biol. Chem. 
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260:3440; McColI, S, et al. ( 1 993) J. Immunol. 1 50:4550-4555; and Aussel, C. et aL ( 1 988) supra ) 
The assay requires preloading neutrophils or T ceils with a fluorescent dye such as FURA-2 or 
BCECF (Universal Imaging Corp, Westchester PA) whose emission characteristics have been 
altered by Ca^ binding. When the cells are exposed to one or more activating stimuli artificially 
5 (ie, anti-CD3 antibody ligation of the T cell receptor) or physiologically (ie, by allogeneic 

stimulation), Ca^ flux takes place. This flux can be observed and quantified by assaying the cells 
in a fiuorometer or fluorescent activated cell sorter. Measurements of Ca"^ flux are compared 
between cells in their normal state and those preloaded with CSIGP. 

Protein kinase activity in CSIGP is determined by measuring the phosphorylation of a 

10 protein substrate using gamma- labeled ^^P-ATP and quantitation of the incorporated radioactivity 
using a radioisotope counter. CSIGP is incubated with the protein substrate, ^-P-ATP, and an 
appropriate kinase buffer. The ^-P incorporated into the product is separated from free ^-P-ATP by 
electrophoresis and the incorporated is counted. The amount of ^-P recovered is proportional 
to the activity of CSIGP in the assay. A determination of the specific amino acid residue 

15 phosphorylated is made by phosphoamino acid analysis of the hydrolyzed protein. 

Protein phosphatase (PP) activity in CSIGP is determined by measuring the hydrolysis of 
P-nitrophenyl phosphate (PNPP). CSIGP is incubated together with PNPP in HEPES buffer pH 
7.5, in the presence of 0. 1% b-mercaptoethanol at 37°C for 60 min. The reaction is stopped by the 
addition of 6 ml of 10 N NaOH and the increase in light absorbance at 410 nm resulting from the 

20 hydrolysis of PNPP is measured using a spectrophotometer. The increase in light absorbance is 
proportional to the activity of CSIGP in the assay. 
XI. Production of CSIGP Specific Antibodies 

CSIGP substantially purified using polyacrylamide gel electrophoresis (PAGE; see, e.g., 
Harrington, M.G. (1990) Methods Enzymoi. 182:488-495), or other purification techniques, is 

25 used to immunize rabbits and to produce antibodies using standard protocols. 

Alternatively, the CSIGP amino acid sequence is analyzed using LASERGENE software 
(DNASTAR) to determine regions of high immunogenicity, and a corresponding oligopeptide is 
synthesized and used to raise antibodies by means known to those of skill in the art. Methods for 
selection of appropriate epitopes, such as those near the C-terminus or in hydrophilic regions are 

30 well described in the art. (See, e.g., Ausubel, 1995, supra , ch. II.) 

Typically, oligopeptides 15 residues in length are synthesized using an ABI 43 lA 
Peptide Synthesizer (Perkin-Elmer) using fmoc-chemistry and coupled to KLH (Sigma-Aldrich, 
St. Louis MO) by reaction with N-maleimidobenzoyl-N-hydroxysuccinimide ester (MBS) to 
increase immunogenicity. (See, e.g.. AusubeL 1995, supra .) Rabbits are immunized with the 

35 oligopeptide-ICLH complex in complete Freund's adjuvant. Resulting antisera are tested for 
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antipeptide activity by, for example, binding the peptide to plastic, blocking with 1% BSA, 
reacting with rabbit antisera, washing, and reacting with radio-iodinated goat anti-rabbit IgG. 
XII. Purification of Naturally Occurring CSIGP Using Specific Antibodies 

Naturally occurring or recombinant CSIGP is substantially purified by immunoaffinity 
chromatography using antibodies specific for CSIGP. An immunoaffinity column is constructed 
by covalently coupling anti-CSIGP antibody to an activated chromatographic resin, such as 
CNBr-activated SEPHAROSE (Amersham Pharmacia Biotech). After the coupling, the resin is 
blocked and washed according to the manufacturer's instructions. 

Media containing CSIGP are passed over the immunoaffinity column, and the column is 
washed under conditions that allow the preferential absorbance of CSIGP (e.g., high ionic strength 
buffers in the presence of detergent). The column is eluted under conditions that disrupt 
antibody/CSIGP binding (e.g., a buffer of pH 2 to pH 3, or a high concentration of a chaotrope, 
such as urea or thiocyanate ion), and CSIGP is collected. 
XIIL Identification of Molecules Which Interact with CSIGP 

CSIGP, or biologically active fragments thereof, are labeled with '-^I Bohon-Hunter 
reagent. (See, e.g., Bolton et al. (1973) Biochem. J. 133:529.) Candidate molecules previously 
arrayed in the wells of a multi-well plate are incubated with the labeled CSIGP, washed, and any 
wells with labeled CSIGP complex are assayed. Data obtained using different concentrations of 
CSIGP are used to calculate values for the number, affinity, and association of CSIGP with the 
candidate molecules. 

Various modifications and variations of the described methods and systems of the 
invention will be apparent to those skilled in the art without departing from the scope and spirit of 
the invention. Although the invention has been described in connection with specific preferred 
embodiments, it should be understood that the invention as claimed should not be unduly limited 
to such specific embodiments. Indeed, various modifications of the described modes for carrying 
out the invention which are obvious to those skilled in molecular biology or related fields are 
intended to be within the scope of the following claims. 
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What is claimed is: 



1 . A substantially purified polypeptide comprising an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-13, or a fragment thereof. 
5 2, A substantially purified variant having at least 90% amino acid identity to the 

amino acid sequence of claim L 

3. An isolated and purified polynucleotide encoding the polypeptide of claim 1. 

4. An isolated and purified polynucleotide variant having at least 70% 
polynucleotide sequence identity to the polynucleotide of claim 3. 

10 5. An isolated and purified polynucleotide which hybridizes under stringent 

conditions to the polynucleotide of claim 3, 

6. An isolated and purified polynucleotide having a sequence which is 
complementary to the polynucleotide of claim 3. 

7. A method for detecting a polynucleotide, the method comprising the steps of: 
*5 (a) hybridizing the polynucleotide of claim 6 to at least one nucleic acid in 

the sample, thereby forming a hybridization complex; and 

(b) detecting the hybridization complex, wherein the presence of the 
hybridization complex correlates with the presence of the polynucleotide in the sample. 

8. The method of claim 7 further comprising amplifying the polynucleotide prior to 
20 hybridization. 

9. An isolated and purified polynucleotide comprising a polynucleotide sequence 
selected from the group consisting of SEQ ID NO: 14-26, or a fragment thereof 

10. An isolated and purified polynucleotide variant having at least 70% 
polynucleotide sequence identity to the polynucleotide of claim 9. 

25 11. An isolated and purified polynucleotide having a sequence which is 

complementary to the polynucleotide of claim 9. 

12. An expression vector comprising at least a fragment of the polynucleotide of 

claim 3. 

13. A host cell comprising the expression vector of claim 12. 

30 14. A method for producing a polypeptide, the method comprising the steps of 

a) culturing the host cell of claim 13 under conditions suitable for the 
expression of the polypeptide; and 

b) recovering the polypeptide from the host cell culture. 

15. A pharmaceutical composition comprising the polypeptide of claim 1 in 
35 conjunction with a suitable pharmaceutical carrier. 
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A purified antibody which specifically binds to the polypeptide of claim I 
A purified agonist of the polypeptide of claim 1 . 



A purified antagonist of the polypeptide of claim 1 . 

A method for treating or preventing a disorder associated with decreased 



5 expression of CSIGP, the method comprising administering to a subject in need of such treatment 
an effective amount of the pharmaceutical composition of claim 15. 

20. A method for treating or preventing a disorder associated with increased 
expression of CSIGP, the method comprising administering to a subject in need of such treatment 
an effective amount of the antagonist of claim 18. 



10 
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SEQUENCE LISTING 



<110> INCYTE PHARMACEUTICALS, INC. 
BANDMAN, Oiga 
HILLMAN, Jennifer L. 
LAL, Preeti 
YUE, Henry 
TANG, Y. Tom 
PATTERSON, Chandra 
BAUGHN, Mariah R. 
YANG, Junming 

<120> CELL SIGNALING PROTEINS 

<130> PF-0521 PCT 

<140> To Be Assigned 
<141> Herewith 

<150> 60/085,3434; 60/098,010 
<151> 1998-05-13; 1998-08-26 

<160> 26 

<170> PERL Program 

<210> 1 

<211> 418 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc-feature 

<223> Incyte Clone 016108 

<400> 1 



Met 


Ser 


Leu 


Leu 


Asp 


Cys 


Phe 


Cys 


Thr 


Ser 


Arg 


Thr 


Gin 


Val 


Glu 


1 








5 










10 










15 


Ser 


Leu 


Arg 


Pro 


Glu 


Lys 


Gin 


Ser 


Glu 


Thr 


Ser 


He 


His 


Gin 


Tyr 










20 










25 










30 


Leu 


Vai 


Asp 


Glu 


Pro 


Thr 


Leu 


Ser 


Trp 


Ser 


Arg 


Pro 


Ser 


Thr 


Arg 










35 










40 










45 


Ala 


Ser 


Glu 


Val 


Leu 


Cys 


Ser 


Thr 


Asn 


Val 


Ser 


His 


Tyr 


Glu 


Leu 










50 










55 










60 


Gin 


Val 


Glu 


He 


Gly 


Arg 


Gly 


Phe 


Asp 


Asn 


Leu 


Thr 


Ser 


Val 


His 










65 










70 










75 


Leu 


Ala 


Arg 


His 


Thr 


Pro 


Thr 


Gly 


Thr 


Leu 


Val 


Thr 


He 


Lys 


He 










80 










85 








90 


Thr 


Asn 


Leu 


Glu 


Asn 


Cys 


Asn 


Glu 


Glu 


Arg 


Leu 


Lys 


Ala 


Leu 


Gin 










95 










100 








105 


Lys 


Ala 


Val 


He 


Leu 


Ser 


His 


Phe 


Phe 


Arg 


His 


Pro 


Asn 


He 


Thr 










110 










115 










120 


Thr 


Tyr 


Trp 


Thr 


Val 


Phe 


Thr 


Val 


Gly 


Ser 


Trp 


Leu 


Trp 


Val 


He 










125 










130 










135 


Ser 


Pro 


Phe 


Met 


Ala 


Tyr 


Gly 


Ser 


Ala 


Ser 


Gin 


Leu 


Leu 


Arg 


Thr 










140 










145 










150 


Tyr 


Phe 


Pro 


Glu 


Gly 


Met 


Ser 


Glu 


Thr 


Leu 


He 


Arg 


Asn 


He 


Leu 










155 










160 










165 


Phe 


Gly 


Ala 


Val 


Arg 


Gly 


Leu 


Asn 


Tyr 


Leu 


His 


Gin 


Asn 


Gly 


Cys 










170 










175 










180 


He 


His 


Arg 


Ser 


He 


Lys 


Ala 


Ser 


His 


He 


Leu 


He 


Ser 


Gly 


Asp 










185 










190 










195 


Giy 


Leu 


Val 


Thr 


Leu 


Ser 


Gly 


Leu 


Ser 


His 


Leu 


His 


Ser 


Leu 


Val 
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200 








Lys 


His 


Gly 


Gin 


Arg 
215 


His 


Arg 


Ala 


Ser 


Thr 


Ser 


Val 


Gin 
230 


Pro 


Trp 


Leu 


Asp 


Leu 


His 


Gly 


Leu 
245 


Tyr 


Val 


Lys 


He 


Thr 


Ala 


Cys 


Glu 
2 60 


Leu 


Ala 


Ser 


Met 


His 


Arg 


Thr 


Gin 


Met 


Leu 


Leu 








275 








Tyr 


Ser 


Pro 


Leu 


Asp 
290 


He 


Ser 


He 


Met 


Lys 


Asn 


Ser 


Gin 
305 


Ser 


Gly 


Val 


Val 


Leu 


Val 


Ser 


Ser 
320 


Gly 


Thr 


His 


His 


Thr 


Pro 


Ser 


Ser 
335 


Lys 


Thr 


Phe 


Val 


Gin 


Leu 


Cys 


Leu 
350 


Gin 


Gin 


Asp 


Ser 


Ser 


Leu 


Leu 


Ser 
365 


His 


Val 


Phe 


Ser 


Gin 


Asp 


Ser 


He 
380 


Leu 


Ser 


Leu 


Pro 


Ser 


He 


Ser 


Leu 
395 


Pro 


Pro 


Val 


Cys 


Asp 


Phe 


Pro 


Asp 
410 


Glu 


Lys 


Asp 





205 










210 


Val 


Tyr 
220 


Asp 


Phe 


Pro 


Gin 


Phe 
225 


Ser 


Pro 

O c 

235 


Glu 


Leu 


Leu 


Arg 


Gin 
240 


Ser 


Asp 
o c o 


He 


Tyr 


Ser 


Val 


Gly 

o c c 


Gly 


Gin 


Val 


Pro 


Phe 


Gin 


Asp 


O C Q 

^ bo 










O T O 


Gin 


Lys 

o Q n 
^oU 


Leu 


Lys 


Gly 


Pro 


Pro 
o o c 


Phe 


Pro 

O Q C 


Gin 


Ser 


Glu 


Ser 


Arg 

o n A 


Asp 


Ser 


Gly 


He 


Gly 


Glu 


Ser 
Jib 


Thr 


Val 
325 


Asn 


Ser 


Asp 


Arg 


Leu 
330 


Ser 


Pro 
34 0 


Ala 


Phe 


Phe 


Ser 


Leu 
345 


Pro 


Glu 
355 


Lys 


Arg 


Pro 


Ser 


Ala 
360 


Phe 


Lys 
370 


Gin 


Met 


Lys 


Glu 


Glu 
375 


Leu 


Pro 
385 


Pro 


Ala 


Tyr 


Asn 


Lys 
390 


Leu 


Pro 
400 


Trp 


Thr 


Glu 


Pro 


Glu 
405 


Ser 


Tyr 
415 


Trp 


Glu 


Phe 







<210> 2 
<211> 540 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc-feature 

<223> Incyte Clone 640521 

<400> 2 



Met 


Ser 


Ser 


Cys 


Val 


Ser 


Ser 


Gin 


Pro 


Ser 


Ser 


Asn 


Arg 


Ala 


Ala 


1 






5 










10 










15 


Pro 


Gin 


Asp 


Glu 


Leu 


Gly 


Gly 


Arg 


Gly 


Ser 


Ser 


Ser 


Ser 


Glu 


Ser 










20 










25 










30 


Gin 


Lys 


Pro 


Cys 


Glu 


Ala 


Leu 


Arg 


Gly 


Leu 


Ser 


Ser 


Leu 


Ser 


He 










35 










40 










45 


His 


Leu 


Gly 


Met 


Glu 


Ser 


Phe 


He 


Val 


Val 


Thr 


Glu 


Cys 


Glu 


Pro 








50 










55 










60 


Gly 


Cys 


Ala 


Val 


Asp 


Leu 


Gly 


Leu 


Ala 


Arg 


Asp 


Arg 


Pro 


Leu 


Glu 










65 










70 










75 


Ala 


Asp 


Gly 


Gin 


Glu 


Val 


Pro 


Leu 


Asp 


Ser 


Ser 


Gly 


Ser 


Gin 


Ala 








80 










85 










90 


Arg 


Pro 


His 


Leu 


Ser 


Gly 


Arg 


Lys 


Leu 


Ser 


Leu 


Gin 


Glu 


Arg 


Ser 










95 










100 










105 


Gin 


Gly 


Gly 


Leu 


Ala 


Ala 


Gly 


Gly 


Ser 


Leu 


Asp 


Met 


Asn 


Gly Arg 










110 










115 










120 


Cys 


He 


Cys 


Pro 


Ser 


Leu 


Pro 


Tyr 


Ser 


Pro 


Val 


Ser 


Ser 


Pro 


Gin 










125 










.130 










135 


Ser 


Ser 


Pro 


Arg 


Leu 


Pro 


Arg 


Arg 


Pro 


Thr 


Val 


Glu 


Ser 


His 


His 










140 










145 










150 


Val 


Ser 


He 


Thr 


Gly 


Met 


Gin 


Asp 


Cys 


Val 


Gin 


Leu 


Asn 


Gin 


Tyr 
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155 



160 



165 



Thr 


Leu 


Lys 


Asp 


/—lit 

G±U 


lie 


Gly 


Lys 


xr 

kjxy 


oer 


Tyr 


(jxy 


V ax 


V a X 


Lys 










1 / u 










X / o 










X o u 


Leu 


A J. a 


Tyr 


Asn 




Asn 


Asp . 


Asn 


i nr 


Tyr 


Tyr 


rixa 




Lys 


V dX 










i 0 D 










X w 










1 95 


Leu 


Ser 


Lys 


Lys 


Lys 


Leu 


X xe 


Arg 


Pin 
(aX n 




(Z1 X7 

ox y 


IT lltr 


It X (_) 


rg 


Arg 










9 n n 
z u u 










zu o 










^ X u 


Pro 


Pro 


Pro 


Arg 


vjX y 


i nr 


Arg 


Pro 


a 1 a 
r\Xa 


Pro 


oxy 


ox y 


y ^ 


He 


Gin 










^ -L -) 










220 










225 


Piro 


Arg 


ox y 


Pro 


X X ^ 


oX u 


oX 11 


V d X 


i yr 


r:T n 

VjX 11 


VjX Li 


X x t; 


Ala 


He 


Leu 
















-J — ' 










240 


Lys 


Lys 


Leu 


Asp 


rlXS 


Pro 


Asn 


V tl X 


V dX 


Lys 


Leu 


V d X 


O X Ll 


V d X 


Leu 






O A ^ 
Z D 










z o \j 










2 55 


Asp 


Asp 


Pro 


Asn 


oXU 


Asp 


nx S 


Leu 


Tyr 


LYlci L 


V ax 


Pho 

It Uc 


O X li 


Leu 


Va 1 
V d X 
















ZOO 










9 7 n 


Asn 


Gin 


(jXy 


Pro 


V ax 


Met 


VjlU 


V ax 


Pro 


i nr 


Leu 


Lys 


Pro 


Leu 


c o r~ 
OCX 








O "7 Q 










o Q n 
z oU 










9 ft S 
Z O ^ 


Glu 


Asp 


Gin 


Ala 


Arg 


Phe 


Tyr 


Phe 


Lixn 


Asp 


Leu 


xxe 


Lys 


oxy 


T 1 Q 

xxe 










o Q n 
z y u 










O Q ^ 

z y o 










_> u u 


Glu 


Tyr 


Leu 


His 


Tyr 


Gin 


Lys 


Tin 

He 


Tin 

lie 


His 


Arg 


Asp 


1 xe 


Lys 


Pro 










'3 n Q 










Tin 
J lU 










^ X o 


Ser 


Asn 


Leu 


Leu 


Val 


Gly 


Glu 


Asp 


Gly 


nlS 


Tin 

lie 


Lys 


i le 


Axa 


Asp 










Ton 
Jz U 










"5 O 
-3 z O 










O J u 


Phe 


Gly 


Val 


Ser 


Asn 


Glu 


Phe 


Lys 


Gly 


ber 


Asp 


Ala 


Leu 


Leu 


Ser 








'3 T C 
J JO 










J4 U 










J H 0 


Asn 


Thr 


Val 


Gly 


Thr 


Pro 


Ala 


Phe 


Met 


AX a 


Pro 


LslU 


oe r 


Leu 


Q i=i -r- 

oe r 








•D O 










"5 

J D O 










T n 

O 0 u 


Glu 


Thr 


Arg 


Lys 


lie 


Phe 


Ser 


Gly 


Lys 


Ala 


Leu 


Asp 


val 


Trp 


a 1 o 
AX a 










ez c 










"3 T n 












Met 


Gly 


Val 


Thr 


Leu 


Tyr 


Cys 


Phe 


Val 


Phe 


Gly 


Gin 


Cys 


Pro 


Phe 








380 










"3 Q C 










-3 y u 


Met 


Asp 


Glu 


Arg 


lie 


Met 


Cys 


Leu 


His 


Ser 


Lys 


He 


Lys 


Ser 


(jin 










395 










4 UU 










4 u o 


Ala 


Leu 


Glu 


Phe 


Pro 


Asp 


Gin 


Pro 


Asp 


He 


Ala 


Glu 


Asp 


Leu 


Lys 










410 










4 lo 










/ion 
4 Z U 


Asp 


Leu 


lie 


Thr 


Arg 


Met 


Leu 


Asp 


Lys 


Asn 


Pro 


Giu 


Ser 


Arg 


lie 










4 z 0 










4 -3U 










4 

4 J O 


Val 


Val 


Pro 


Glu 


lie 


Lys 


Leu 


His 


Pro 


Trp 


val 


Thr 


Arg 


rix S 


Lixy 










4 4 U 










4 4 D 










4 0 u 


Ala 


Glu 


Pro 


Leu 


Pro 


Ser 


Glu 


Asp 


/"111 

Glu 


Asn 


Cys 


i nr 


Leu 


V ax 


LjXU 










4 55 










4 60 










465 


Val 


Thr 


Glu 


Glu 


Glu 


' Vai 


Glu 


Asn 


Ser 


Val 


Lys 


His 


He 


Pro 


Ser 










470 










475 










480 


Leu 


Ala 


Thr 


Val 


He 


Leu 


Val 


Lys 


Thr 


Met 


lie 


Arg 


Lys 


Arg 


Ser 










485 










490 










495 


Phe 


Gly 


Asn 


Pro 


Phe 


Glu 


Gly 


Ser 


Arg 


Arg 


Glu 


Glu 


Arg 


Ser 


Leu 










500 










505 










510 


Ser 


Ala 


Pro 


Gly 


Asn 


Leu 


Leu 


Thr 


Lys 


Gin 


Gly 


Ser 


Glu 


Asp 


Asn 








515 










520 










525 


Leu 


Gin 


Gly Thr 


Asp 


Pro 


Pro 


Pro 


Val 


Gly 


Glu 


Glu 


Glu 


Val 


Leu 



<210> 3 

<211> 729 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc-f eature 

<223> Incyte Clone 1250171 



530 



535 



540 
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<400> 3 

Met Gin Ser Thr Ser 
1 5 
Gly Gin Gly Ala Thr 
20 

Thr Gly Asp Leu Phe 
35 

Leu Arg Pro Val Asp 
50 

Lys Leu Asn His Lys 
65 

Glu Thr Thr Thr Arg 
80 

Cys Gly Ser Leu Tyr 
95 

Gly Leu Pro Glu Ser 
110 

Gly Gly Met Asn His 
125 

He Lys Pro Gly Asn 
140 

Ser Val Tyr Lys Leu 
155 

Asp Asp Glu Gin Phe 
170 

His Pro Asp Met Tyr 
185 

Lys Lys Tyr Gly Ala 
200 

Phe Tyr His Ala Ala 
215 

Gly Pro Arg Arg Asn 
230 

Lys Pro Ser Gly Ala 
245 

Pro He Asp Trp Ser 
260 

Arg Gly Leu Gin Val 
275 

Glu Ala Asp Gin Glu 
290 

Glu Thr Ser Asp He 
305 

Leu Gin Gin Met Thr 
320 

Thr Ala Thr He Phe 
335 

He Ser Ser Asn Gin 
350 

Leu Glu Pro Gly Arg 
365 

Glu Asn Pro He Phe 
380 

Gly Leu He Tyr Glu 
395 

Tyr AsD Leu Asp Gly 
410 

Val Val Cys Tyr Ala 
425 

Gin Glu Leu Met Arg 
440 

Lys Asp AsD Tyr Asn 
455 

He Thr Leu Asp Phe 



As n 


nx o 


Leu 


I rp 


Leu 
10 


a 


a ici^ 


Ua 1 
V ax 




a t~ rr 

25 




•L X 


T 

Xty o 


Val 


40 




v3X 11 




Arg 


Glu 
55 


As n 


He 


Val 


— y- 


Leu 
70 




T \7« 

jjy o 


V ax 


-U CJ 


T 1 ^ 

X X C7 

85 




V ci X 


T on 


Glu 


Glu 
100 


ox (J 




Leu 


He 


Val 
115 


Leu 


Arg 


OX li 


Asn 


Gly 
130 


J. xe 


weT- 


Arg 


Val 


He 
145 


i nr 


Asp 


Ph £i 

irne 


Gly Ala 










160 


V a± 


Ser 


Leu 


Tyr 


Gly 
175 


/—Til 


Arg 


AX a 


Val 


Leu 
190 


i nr 


V ax 


Asp 


Leu 


Trp 

o r\ c 


i nr 


t:iXy 


oer 


Leu 


Pro 

o o n 


Lys 


CjXU 


vax 


Met 


Tyr 


ixe 


c ^ -K- 

ber 


uxy 


Val 


Gin 
250 


(jXy 


Asp 


Met 


Pro 


Val 
265 


Leu 


Leu 


i nr 


Pro 


Val 
280 


Lys 


Cys 


Trp 


Gly 


Phe 
295 


Leu 


LI -1 C! 

n X s 


Arg 


Met 


Val 
310 


riXa 


tlx S 


Lys 


He 


Tyr 
325 


oX s 


*jX U 


Leu 


Val 


Tyr 
340 


r 1 n 
*jX u 


Leu 


X X fc; 


Tyr 


Glu 
355 




r-VX ct 


Gin 


His 


Phe 
370 


Val 


Val 


Ser 


Arg 


Glu 
385 


Lys 


He 


Ser 


Leu 


Pro 








400 


Asp 


Ala 


Ser 


Met 


Ala 
415 


Cys 


Arg 


He 


Ala 


Ser 
430 


Lys 


Gly 


He 


Arg 


Trp 
445 


Glu 


Thr 


Val 


His 


Lys 
460 


Cys 


He Arg 


Asn 


He 



Leu 


Ser 


Asp 


X X ^ 


Leu 
15 


VI X y 


rtx y 


nx o 


T. V ^ 


Xiy o 

30 


As n 


As n 


He 


Ser 


Phe 
45 


Phe 


Glu 


Val 


Leu 


Lys 
60 


Phe 


A.la 


He 


Glu 


Glu 
75 


Met 


Glu 


Phe 


Cys 


Pro 
90 


P T O 
IT X W 


C Q r" 

O CSX 


As n 


Ala 


105 


Ton 
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120 


Asp 


val 


Gin 



























<210> 14 

<211> 1957 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc-feature 

<223> Incyte Clone 016108 



<400> 14 

atttttgtca 

acttcaagaa 

caatacttgg 

gtactatgtt 

gacaacttga 

aaaattacaa 

attctatccc 

ggcagctggc 

aggacctatt 

gtgagagggt 

catatcctca 

ttggttaagc 

gtgcagccgt 

tcagatattt 

caggacatgc 

ttggatatca 

gactctggga 

cgattacaca 

tgtttgcaac 

ttcaaacaga 

aacaagccat 

cctgatgaaa 



ctttctgtgt 
cacaagttga 
ttgatgagcc 
ccaccaacgt 
cttctgtcca 
atctggaaaa 
actttttccg 
tttgggttat 
tccctgaagg 
tgaactatct 
tttctggtga 
atggacagag 
ggttgagtcc 
acagtgttgg 
atagaactca 
gtattttccc 
ttggagaaag 
caccatcctc 
aagatcctga 
tgaaagaaga 
caatatcatt 
aagactcata 



gaactaaagt 
atcactcaga 
aaccctttcc 
ttctcactat 
tcttgcacgg 
ctgcaatgaa 
gcatcccaat 
ttctccattt 
aatgagtgaa 
gcaccaaaat 
tggcctagtg 
gcatagggct 
agaactactg 
gatcacagca 
gatgctgtta 
tcaatcagaa 
tgtgcttgtc 
aaaaactttc 
gaaaaggcca 
aagccaggat 
gcctccagtg 
ctgggaattc 



gattcaatgt 
cctgaaaaac 
tggtcacgtc 
gagctccaag 
catactccca 
gaacgcctga 
attacaactt 
atggcctatg 
actttaataa 
ggctgtattc 
accctctctg 
gtgtatgatt 
agacaggatt 
tgtgaattag 
cagaaactga 
tccagaatga 
tccagtggaa 
tctcctgcct 
tcagcaagca 
tcaatacttt 
ttaccttgga 
tagggctgcc 



ctcttttgga 
agtctgaaac 
catccactag 
tagaaatagg 
caggaacact 
aagctttaca 
attggacagt 
gttcagcaag 
gaaacattct 
acaggagtat 
gcctgtccca 
tcccacagtt 
tacatgggtt 
ccagtgggca 
aaggtcctcc 
aaaattccca 
ctcacacagt 
tctttagctt 
gtttattgtc 
cactgttgcc 
ctgagccaga 
aaatcatttt 



ttgcttctgt 
cagtatccat 
agccagtgaa 
aagaggattt 
ggtaactata 
gaaagccgtg 
tttcactgtt 
tcaactcttg 
ctttggagcc 
taaagccagc 
tctgcatagt 
cagcacatca 
atatgtgaag 
ggtgcctttc 
ttatagccca 
gtcaggtgta 
aaatagtgac 
ggtacagctc 
ccatgttttc 
tcctgcttat 
atgtgatttt 
atgtcctata 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
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tacttgacac 
tatacttgaa 
aggggcacca 
aattactaat 
tttttatctc 
tgggaatggg 
tttttaatga 
gtagagggaa 
actgagcctc 
ttuttttaaa 
attcacgcac 



tttctccttg 
aatacagttg 
gtttgtagtc 
ctagctagtg 
tcaatataat 
ccctcagagg 
gctattgtta 
aatgtttgtg 
actcacatta 
aaagggggct 
cccgtgggag 



ctgctttttc 
gtgcactgga 
cctctgtttc 
tcatttattc 
ttttgagcca 
acagtgcttc 
aaccaacagg 
ctttcccttt 
aatgattcac 
aaag"taacac 
ctcaataaag 



ttctgtattt 
gaatctatta 
gcacagagta 
tggaattttt 
gttaattttt 
caagtacatc 
ctagtttatc 
ttcttctgtt 
ttgaaatata 
tttcctactt 
atttact 



ctaggtacaa 
tttaaaacca 
ctatgacaag 
ttctaagctg 
ttcagtattt 
ttctcccaga 
ttacatcaga 
aatacttatg 
tacagaaatt 
atgtaaatta 



ataccagaat 
ctctgttcaa 
gaaacatcag 
tgactaactc 
tgctgtccct 
ttctctggcc 
cccttttctg 
gtaacaccta 
gtaatttgct 
tagatcctaa 



1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1957 



<210> 15 

<211> 2545 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc-f eature 

<223> Incyte Clone 640521 



<400> 15 

gagccgagct gggggcgcag agcgcgggag 
ctgcagtgca gtggtgcgat ctcggctcag 
tgattctcct gcctcagcct cccgagtagc 
ggtggctccg ctgccggatg ggagtgcccc 
atgtcatcat gtgtctctag ccagcccagc 
gggggcaggg gcagcagcag cagcgaaagc 
tcatccttga gcatccacct gggcatggag 
ggctgtgctg tggacctcgg cttggcgcgg 
gtcccccttg actcctccgg gtcccaggcc 
ctgcaagagc ggtcccaggg tgggctggca 
tgcatctgcc cgtccctgcc ctactcaccc 
ccccggcggc cgacagtgga gtctcaccac 
cagctgaatc agtataccct gaaggatgaa 
ttggcctaca atgaaaatga caatacctac 
ctgatccggc aggccggctt tccacgtcgc 
ggaggctgca tccagcccag gggccccatt 
aagaagctgg accaccccaa tgtggtgaag 
gaccatctgt acatggtgtt cgaactggtc 
ctcaaaccac tcrctgaaga ccaggcccgt 
gagtacttac actaccagaa gatcatccac 
ggagaagatg ggcacatcaa gatcgcrgac 
gacgcgctcc tctccaacac cgtgggcacg 
gagacccgca agatcttctc tgggaaggcc 
tactgctttg tctttggcca gtgcccattc 
aagatcaaga gtcaggccct ggaatttcca 
gacctgatca cccgtatgct ggacaagaac 
aagctgcacc cctgggtcac gaggcatggg 
tgcacgctgg tcgaagtgac tgaagaggag 
ttggcaaccg tgatcctggt gaagaccatg 
gagggcagcc ggcgggagga acgctcactg 
ggcagcgaag acaacctcca gggcaccgac 
ttgtgagagg cagtccctgc gtggaaagtt 
gcatgcatcc actgcggccg gaggaggcca 
acctcgcatg cgcgccgcgt cgcctctggg 
tgtcctacgg aaacccagca cgtgtgtaga 
tttcctttgt tgttttaaag gggacaaaaa 
tcgaccgtgg ccgctggctg gctggacagg 
cgtgcatttt gggacaattg ctttttaaaa 
ttttcagaac cacgtcagat ataccaagtg 
ggcgagcaga aaactccggc ggtctgaggc 
gtagggggct agatgtggct cctagtgcaa 



gcggcggcgg cgcggaccca gtcacccagg 60 
tcattgcaac crtcacctcc cggattcaag 120 
tgggattaca ggtgcccacc accatgccca 180 
agtgtgctgg atgaagctgg cgcatgcacc 240 
agcaaccggg ccgcccccca ggatgagctg 300 
cagaagccct gtgaggccct gcggggcctc 360 
tccttcattg tggtcaccga gtgtgagccg 420 
gaccggcccc tggaggccga tggccaagag 480 
cggccccacc tctccggtcg caagctgtct 540 
gccggtggca gcctggacat gaacggacgc 600 
gtcagctccc cgcagtcctc gcctcggctg 660 
gtctccatca cgggtatgca ggactgtgtg 720 
attggaaagg gctcctatgg tgtcgtcaag 780 
tatgcaatga aggtgctgtc caaaaagaag 840 
cctccacccc gaggcacccg gccagctcct 900 
gagcaggtgt accaggaaat tgccatcctc 960 
ctggtggagg tcctggatga ccccaatgag 1020 
aaccaagggc ccgtgatgga agtgcccacc 1080 
ttctacttcc aggatctgat caaaggcatc 1140 
cgtgacatca aaccttccaa cctcctggtc 1200 
tttggtgtga gcaatgaatt caagggcagt 1260 
cccgccttca tggcacccga gtcgctctct 1320 
ttggatgttt gggctatggg tgtgacacta 1380 
atggacgagc ggatcatgtg tttacacagt 1440 
gaccagcccg acatagctga ggacttgaag 1500 
cccgagtcga ggatcgtggt gccggaaatc 1560 
gcggagccgt tgccgtcgga ggatgagaac 1620 
gtcgagaact cagtcaaaca cattcccagc 1680 
atacgtaaac gctcctttgg gaacccattc 1740 
tcagcgcctg gaaacttgct cacgaagcaa 1800 
ccgccccccg tgggggagga ggaagtgctc 18 60 
gctgggcccc cgcccccggc tcccccgcac 1920 
tggagcccga gtagctgcct ggatcgctcg 1980 
gggctgctgc accgcgtttc catagcagca 2040 
gcctcgatcg tcatctctgg ttatttgttt 2100 
aaaaaaaaaa aaggacttga ctccatgacg 2160 
cgggtgtgag gagttgcaga cccaaaccca 2220 
cgtttttatg ccaaaaatcc ttcattgtga 2280 
actgtgtgtg gggtttgaca actgtggaaa 2340 
catggaggtg gttgctgcat ttgagaggga 24 00 
accggaaacc atggcacctt ccagagccgt 24 60 
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ggtctcaagg agtcagagca gggctggccc tcagtagctg cagggagctt tgatggcaac 2520 
ttattttgtt aagaagggtt ttttt 2545 



<210> 16 
<211> 3034 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc-feature 

<223> Incyte Clone 1250171 



<400> 16 

tcctgagtct 

gtataacaag 

ttttatctga 

aaactggtga 

atgttcaaan 

tatttgctat 

catgtgggag 

ctgaattctt 

gtatagtgca- 

agtctgtgna 

ttgrttctcr 

taagaaaaga 

cattttacca 

ataaagaagt 

agaaagcaga 

ctcggggtct 

aaaagtgttg 

taattcatgt 

atacngctac 

aagaacttat 

tccctaaaac 

taggattaat 

gggatgctag 

gtaccttact 

ttaaagatga 

tctgtatcag 

tggaagcggc 

gttctcaggg 

gatcacrggc 

aaaaacnaca 

aagcagaacg 

tgtattacca 

aggcattttt 

tattatcgct 

atactaatga 

aacataccat 

tgaagaaatt 

ttttagaaag 

agctttctaa 

ttaaatgaat 

tgtaaatatg 

gatgtaattt 

gagaagaaag 

taaataagtt 

catccttaca 

tgtgagtggg 

ataaagtggc 

taatttcaga 

atttttttcc 

gttttaagct 



cgaggaggcc 
aggattgcct 
tattttaggc 
tttatttgct 
gagagaattt 
tgaagaggag 
tttatacact 
aattgttttg 
ccgtgatatc 
caaactcaca 
gtatggcaca 
tcatcagaag 
tgcagctact 
gatgtataaa 
aaatggacca 
tcaggttcta 
gggttttgac 
tttttcgcta 
tatatttcat 
ctacgaaggg 
tactgaggaa 
atatgaaaaa 
catggctaag 
gctttatcag 
ttacaatgaa 
aaacattgaa 
agagctaggt 
aacaacagaa 
agacgcatgg 
agrcctgtta 
tagattagct 
tgccacaaaa 
gaataagtca 
gactaatcag 
gttacaagaa 
gaccccaatt 
aaaggaagag 
gtttggctct 
tagaagttta 
gcctttatag 
tacaatattg 
tatcttttaa 
ccatgaccga 
attttctctg 
aatcaggaag 
gagcagaacc 
attattttat 
tgaattatta 
tttggccata 
gtatatttct 



gcgggagccc 
gatccagcca 
caaggagcta 
atcaaagtat 
gaagtgttga 
acaacaacaa 
gttttagaag 
cgagatgtgg 
aagccaggaa 
gattttggtg 
gaagaatatt 
aaatatggag 
ggatcactgc 
ataattacag 
attgactgga 
cttacccctg 
cagttttttg 
caacaaatga 
gaactggtat 
cgacgcttag 
aaccctatat 
atttccctcc 
gcaataacag 
gaattaatgc 
actgttcaca 
aaaactgtga 
gaaatttcag 
accagtcttc 
gcacatcaag 
aattgcatga 
tataatgaag 
gctatgacgc 
gaagaatgga 
tgttttgata 
actctgcctc 
tatccaagtt 
atggaagggg 
ttaaccatgg 
agaaaagttt 
atagtcactt 
taaatacata 
catttataat 
ccaatatgtt 
accgcctact 
actgacttga 
gcaccactgt 
tttacaaggt 
agcaaacatt 
aatgtgtaat 
ttaattctgc 



gccggcggtg 
agatgcagag 
ctgcaaacgt 
ttaataacat 
aaaaactcaa 
gacataaagt 
aaccttctaa 
tgggtggaat 
atatcatgcg 
cagccagaga 
tgcaccctga 
caacagttga 
catttagacc 
gaaagccttc 
gtggagacat 
ttcttgcaaa 
cagaaactag 
cagctcataa 
ataaacaaac 
tcttagaacc 
ttgtagtaag 
cnaaagtaca 
gggttgtgtg 
gaaaggggat 
aaaagacaga 
aagtatatga 
acacacacac 
aggatatcga 
aaggcactica 
cagagattta 
aacaaatcca 
actrtacaga 
taagaaagat 
ttgaagaaga 
agaaaatgtt 
ctaacacatt 
tggttaaaga 
atggtggcct 
ccgtttgcac 
gtttctacaa 
aaaaatatac 
tatatgagga 
gacatactga 
ggaaatattt 
cacgtttgta 
tatactggga 
gcccagatcc 
ttaaagtgaa 
tgtcattaaa 
ttactatttc 



gcgcggcgga 
cacttctaat 
ctttcgtgga 
aagcttcctt 
tcacaaaaat 
acttattatg 
tgcctatgga 
gaatcatcta 
tgttataggg 
attagaagat 
tatgtatgag 
tctttggagc 
ctttgaaggg 
tggtgcaata 
gcctgtttct 
catccttgaa 
tgatatactt 
gatttatata. 
caaaattatt 
tggaaggctg 
ccgggaacct 
tccacgttat 
ttatgcctgc 
acgatggctg 
agttgtgatc 
aaagttgatg 
caaattgttg 
cagcagatta 
tccgaaagac 
ctatcagttc 
caaatttgat 
tgaatgtgtt 
gcttcatctt 
agtatcaaaa 
tacagcttcc 
agtagaaatg 
acttgctgaa 
tcgcaacgtt 
aagaaaataa 
ttcagtattt 
aaatttttgg 
aatttgacct 
tcctctactc 
ttaagtggaa 
aatggtagaa 
taacaatttt 
cagttatcct 
ttcattatta 
attctaaggt 
atggaaaaaa 



gacccggctg 
catctgtggc 
agacataaga 
cgtccagtgg 
attgtcaaat 
gaattttgtc 
ctaccagaat 
cgagagaatg 
gaagatggac 
gatgagcagr 
agagcagtgc 
attggggtaa 
cctcgtagga 
tctggagtac 
tgcagtcttt 
gcagatcagg 
caccgaatgg 
catagctata 
tcttcaaatc 
gcacaacatt 
ctgaatacca 
gatttagacg 
agaattgcca 
attgaattaa 
acattggatt 
aagatcaacc 
agactttcca 
tctccaggtg 
agaaatgtag 
aaaaaagaca 
aagcaaaaac 
aaaaagtatg 
aggaaacagt 
tatcaagaat 
agtggaatca 
actcttggta 
aataaccaca 
gactgtcttt 
cgcttgggca 
gatgtggtcg 
ctgctgtgaa 
cagtgatcac 
tgagtggggc 
ccaaaatagg 
cggtggctac 
tttgagaagg 
tgtatccatg 
aaaactattc 
catttcaact 
ataaatttct 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 ^ 
840-^ 
900.- 
960- 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
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caattttaaa aaattttttt ataaaaaaaa aaaa 



3034 



<210> 17 

<211> 1337 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc-f eature 

<223> Incyte Clone 1911587 



<400> 17 

gaaagctgtg 

tgtggccggt 

agaggacggg 

ggcagaggct 

ccagacccga 

gactctgagc 

acaacatcct 

tattggatgt 

tgaagaagaa 

actgcgtggt 

atgattcaga 

gcaggatcct 

gcttctcagg 

tggatgcatt 

tcagtcaagc 

tctccatgga 

tagaagattc 

aaattcacca 

gttgtgccgc 

gggcctatgt 

tgctggaatg 

actgatcttc 

gggtggaggg 



ggaccatcct 
gggggcicctq 
ctttaggctg 
ccgacaggga 
gcccccgacc 
atcggaaagc 
gaatcaggcc 
ccgttccaaa 
aaataatgaa 
gtatgataac 
ctctgatggt 
gacccgcctc 
cacgtaccac 
tcagccatac 
ctgtgacccc 
tacagggccc 
cccggaagcc 
tcaccttggc 
catcatagcc 
caagaagtgc 
ggagaagact 
tccgaggccc 
ccagagt 



ggcaaccccg 
cggtcggagt 
gaagcgcctt 
gcctggccat 
tcagccaaac 
aggatgcctg 
acaaaactct 
tgggagtatg 
tatcttctcc 
aacagcagca 
ganggcaaag 
acccaccacc 
tttctccgga 
cccattgaaa 
aagattcaga 
ttttttgcag 
cagattcttc 
tctgtcattc 
tacctcatgc 
aaaaacaaca 
atccttggag 
accgaagggt 



gtgtttggct 
gggagggcca 
agaggagcca 
agtcgcgcac 
ccattccttc 
gtttgctttt 
ccagattaac 
acgaaagcca 
cggagtctgt 
ccctggagat 
atcttgtgcc 
ccgtctacat 
cccagaagat 
tcgtgccagg 
aggacttgaa 
gcgatgctga 
ccttcttacg 
tgatcttttc 
atagtaacga 
tgtgtccaaa 
attccatcac 
actgaagagc 



gggttctagc 
gtctgcaccc 
tttttccagg 
caggggaggt 
tgtccttgga 
atgtgaacca 
agaccccaac 
tgtgatcact 
ggacctggag 
actcttaaaa 
tcaagcagcc 
cctgaaaggg 
catctggatg 
gaaggtcttc 
aatcaaagcc 
caggcttctg 
ccacatgtgt 
cacccagggt 
gcagaccttg 
tcggggattg 
aaacatcatg 
ctcacctggg 



gtaccgtctg 
aagaggtgga 
tggggcccca 
ggagcgcgtc 
ggccagaggg 
acagagcttt 
tatctctgtt 
gcccttcgag 
tgtgtgaagt 
gatgatgatg 
attgagtatg 
ggctatgagc 
cctcaggaac 
gttggcaatt 
catgtcaatg 
cacatccgga 
cacttcattg 
atcagccgca 
cagaggtcct 
gtgagccagc 
gatccgctct 
ggcattttgt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1337 



<210> 18 

<211> 1639 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc-f eature 

<223> Incyte Clone 2079081 



<400> 18 

gacaaaagcc 

gtatacaaga 

tctgttcttc 

gcaagcaagc 

gcctgggcct 

attgaattgg 

aaatcacttg 

caaattaagg 

cttgtagaaa 

ctgacaaaat 

gcgggttatt 

ctggagctgg 

catatttctc 

agcaagattg 

acagaaaaca 

atggataaag 



agacacattt 
acctaaagga 
agcaaagggc 
tgagcaaagc 
cagagggaat 
aagcaataaa 
acaatgaagt 
ccaagaagaa 
gctccaagca 
caactgaaaa 
ctaccagact 
agaaggaaag 
tttttggcca 
acattgaaaa 
aatctgagtt 
agagacgaaa 



caacatgagg 
gttttctcaa 
aaacctggaa 
actacagaac 
gaaatccaca 
accgacttat 
tgaaaagaca 
attaatggtt 
atctatgact 
gttggaaaag 
gaaatgggaa 
aattcaactt 
aaccctgacc 
agatatccag 
cctgttaacg 
gtctttacta 



gacccactga 
aatggagaga 
attagctatg 
acgagaaaaa 
gcggacctgc 
caagtcctaa 
gcaaatcttg 
agtaccaaga 
gagaaggaga 
gaagatgaaa 
aacacactag 
ttatgcaata 
acatgccaca 
gctgtaatgg 
gattactttg 
aaaccaaaat 



cagattgtcc 
atttctgcaa 
ccaaaggact 
gttgtgttag 
atcaaaaact 
atgtacaaga 
tcattagcaa 
aacatgaagc 
agcggaagct 
attactacca 
agaactgcta 
acttaaacca 
cgcagattca 
aagaaactgc 
aagaagatcc 
tattgagact 



gtataataaa 
acaggtcaca 
tcagaaactg 
cagtgcctgg 
tggcaaagca 
gaagaagaga 
ctggaatcag 
acttttccag 
cctcaataaa 
aaaaaacatg 
ccagagcatt 
gtacagccaa 
ctgtgccatc 
aattttatct 
taacagtgca 
gcagagagac 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
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attgaaaaag 
acctcctcct 
aatttgaaac 
cttgagcaaa 
aaggagcata 
aatattgtga 
cctggtgcag 
caagatgatg 
gaaggatggt 
gaggagttac 
atactacctt 
cttaccttta 



cctcaaaaga 
tctctgatgc 
tagacctttt 
gacctcaacc 
ctcatagcta 
gcaaggcatc 
cccagctcag 
agttgaattt 
ggtttggatc 
cttcaaatgc 
cacactcggt 
aaaaaaaaa 



caaggaaggc 
aaagagccag 
ggaagcgaac 
cagccatcct 
tgtgaaaata 
ttctggtggg 
cagcagactt 
ggaaaagggt 
tttgaatggg 
tggcaacaca 
aatcaacaat 



ctggaacgaa 
aaagacacag 
tcctacaaac 
tgtagtaatt 
tctcggcctt 
cagagcaatc 
tgcaaggcct 
gacattgtga 
aaaaaaggcc 
gctacaaagg 
acagtgtggt 



tgcttaaaac 
cagcgttaat 
tgtcatcaat 
ccatcttcag 
ttttaatgaa 
caggttcttc 
tgtattcttt 
ttatacacga 
attttcctgc 
cataaaacaa 
tcaaataaga 



gtactccagc 
ggatgagaac 
gttagcagaa 
gtggagggaa 
gagattagag 
aactccagcc 
tcaagccagg 
gaaaaaagaa 
cgcttatgtg 
gactctgaac 
ataaagtgct 



1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1639 



<210> 19 

<211> 1504 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc-f eature 

<223> Incyte Clone 2472655 



<400> 19 

cgaaatcgta 

tgcgcgaaga 

ctctgcaagc 

atgagcaagg 

accattgtcc 

aagcaa'aggt 

caactaaata 

tccacagctg 

gtgtttgaga 

tgtcacagtg 

aatttcagcc 

tatgccactg 

tccaatgtgg 

tttatatgga 

actttgacac 

tctctcaatc 

atgacccaga 

gaactggaaa 

ggctcatgcc 

ggagrtcaag 

ataatagctg 

gaggatctct 

ccagcctggg 

aaccagagct 

tggatttcta 

aaaa 



ggacttccga 
tgcgaaaggt 
ggctgctggc 
cagaagctgt 
aggtggatgt 
ttcagagatt 
tcaaagcact 
aaggcctgct 
ccaatgtctt 
acaatccatc 
tcgaggactt 
accttttgag 
cctgtccagg 
cgctgttgat 
catataatgg 
ctctgatcaa 
agatggacct 
agcacattag 
tataattcca 
accagcctga 
ggtgtggtgg 
tgaggctggg 
tgacagcgag 
gacaatgaca 
ctgagttgga 



aagcagcggt 
ggttttgatc 
ggaagatgat 
ctgtgctgcc 
cagcaacctg 
agactgtata 
tttctttggc 
gacccagggt 
tggccatttt 
tcagctcatc 
ccagcacagc 
tgtggctttg 
tacagcattg 
gccggcaata 
aacagaagct 
atatctgagt 
agatgaagac 
ggtcactatt 
gcactttggg 
gaaacatagt 
catgcgcatg 
aggcagaggt 
accctgtctc 
ctctggaaca 
taatatgcat 



ggcgtttgct 
accggggcta 
gagcttcatc 
ctgctggcct 
cagtcggtct 
tatctaaatg 
ctcttttcaa 
gataagatca 
atcctgattc 
tggacatcat 
aaaggcaagg 
aacaggaact 
accaatttga 
ttgctacttc 
ctggtatggc 
gccaccactg 
actgctgaaa 
caaaaaacag 
aggccaaggc 
gagcccttgt 
tagtcccagc 
tgcagtgagc 
aaaatatgta 
ttgcatacct 
ttgtaataaa 



tcactgcttg 
gcagtggcat 
tgtgtttggc 
ctcaccccac 
tccgggcctc 
ctgggatcat 
gaaaagtgat 
ctgctgatgg 
gggaactgga 
ctcgcagtgc 
aaccctacag 
tcaaccagca 
catatggaat 
gcttttttgc 
ttttccacca 
gctttggaag 
aattttatca 
ataatcaggc 
agaaggatca 
ctctacaaaa 
tactcagaag 
tgagattgtg 
tatatttaat 
tctgtacatt 
ctatgaacta 



gaagtgtgag 
tggcctggcc 
gtgcaggaac 
tgctgaggtc 
caaggaactt 
gcctaatcca 
tcatatgttc 
acttcaggag 
gcctctcctc 
aaggaaatct 
ctcttccaaa 
gggtctctat 
tctgcctccg 
aaatgcattc 
aaagcctgaa 
aaattacatt 
aaagttactg 
caggctcagt 
cttgagacca 
agaaataaaa 
gatgaggtgg 
ccactgcact 
atatatataa 
ctggggtaca 
tgaaaaaaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1504 



<210> 20 

<211> 3096 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc-feature 

<223> Incyte Clone 2948818 

<400> 20 

gggtgttctt ataacttaag ttcagttttt tctttcctgt gaggaagagg cagtttttta 60 
aatgaagcca tctctgggga aatcgtattg attgttgtag ctaaatacgg aatttttaaa 120 

21/26 



BNSDOCID: <WO 9958558A2J_> 



wo 99/58558 PCT/US99/1 0567 



gtctttagta 
tcataaatac 
tctgagcaat 
aaaaaaattg 
ggccatcatg 
tgatcctgag 
ttattttgct 
tgggaagcag 
attgaagcat 
gttggtgatg 
acttcaggaa 
acattctcat 
aggtcaggta 
cgtgggcaca 
tgatgggaaa 
gccgcccctt 
cccaacgtta 
gcagaaaata 
acgagaccgg 
tgagctagat 
tggacccttg 
cagggaaatg 
cagccagagc 
tgtcatgatg 
agatcatgta 
tacccagtca 
caaatcagca 
ggaacagatg 
ggagaacaag 
gacgcatgcc 
catagaaaag 
ggcccagcag 
ttgtaaggaa 
gcaagagcgg 
ccaccttctc 
aataatgatc 
gaggacccag 
gctagagtac 
gcaccagacg 
cagaaagcat 
aattaaaaaa 
gaatcaccag 
agatgagcag 
aatgatggcc 
gaggctacag 
gcaaacagag 
cagagcacac 
cgagagaata 
gagcctcaga 
atgagattaa 



tgttgaactg 
aaaggactga 
atggaatatt 
tcaatgagaa 
cgtaaagggg 
gaacttttta 
acaaatgctc 
acccatgaga 
cctaatacta 
gaatattgct 
gtggagatcg 
gcattgattc 
aaactagctg 
ccttactgga 
gttgatattt 
ttcaacatga 
cagtctaatg 
cctcaggaaa 
ccactacgtg 
aacctacagt 
aatgagtcac 
gacagcctgg 
agcagtgtga 
cacgatgacg 
ttcataaggg 
gttcagagcc 
tctttggtta 
tcaggttata 
ctgaaggctg 
aacaactcgt 
gaggcaaagg 
aagaaagatt 
aaaataaaag 
atctccaaac 
actcaacaga 
aagcggcacg 
aaggagatgg 
aggcagctgc 
gaactggaaa 
gtcatggaac 
cagtttcagg 
ttggaagtta 
acaagaaaac 
tctcaagcgt 
ctccagcagg 
gcacaacatg 
cttgagcaga 
aagaacctat 
atgggatttg 
attttttgcc 



gaaatatagg 
agttataaaa 
ttgtgccaca 
ataaagctaa 
tgctgaagga 
ttggtttgca 
acaccagtga 
aatggcaaga 
ttgagtacaa 
taggctcagc 
ctgccattac 
atagggatat 
attttggatc 
tggctccaga 
ggtcacttgg 
atgcaatgag 
aatggacaga 
ggccaacatc 
tcctcattga 
accgaaaaat 
aggaggatga 
gcagcaacca 
acagcatgca 
aaagcacaat 
atgaggcggg 
aggccctcca 
cacgacagat 
agcggatgcg 
agatggacga 
ccatcgagct 
tagctgcagc 
tgacaacttt 
aggaaatgaa 
ataaagagaa 
gactgtacta 
aggtggagca 
agcatgccat 
acacgttaca 
accagctgga 
ttcggcaaca 
acacttgcaa 
ctccaaagaa 
ttgccatttt 
tacggctaga 
aaatggagct 
aacgtgagct 
.agattgaaga 
tggaaaggca 
ggaatttggt 
atttacaaaa 



acatgcgttt 
gagaaaagag 
cacaaaaagg 
ctgatatcaa 
cccagagatt 
tgaaattggg 
ggtggtggca 
tattcttaag 
aggctgttac 
ctctgattta 
tcatggagcc 
taaagcagga 
tgcttcaatg 
ggtgatctta 
catcacttgt 
tgccttatat 
ctcctttagg 
agcagaacta 
cctcatacag 
gaaaaaaata 
ggaagacagt 
ttccattcca 
ggaagtcatg 
caattccagc 
ccacggcgat 
ctaccggaac 
ccatgagcat 
gcgccagcac 
gcaccgcctc 
ggagaagctg 
agatgagaag 
cttagaaagt 
tgaggaccat 
cttgcagcac 
cgacaaaaat 
gcagaacatt 
gctaatccgg 
gaagctacgc 
gtacaataag 
gccaaaaaac 
agtacagacc 
tgagcacaaa 
ggcagagcag 
tgaggctcaa 
gctcaacgcc 
ccagaagcta 
ggagctggct 
agagcgagag 
tacattagat 
aaaaaa 



gagatctact 
aagtttgctg 
tactgaggat 
aaagcagagc 
gccgatctat 
catggaagtt 
attaagaaga 
gaagttaaat 
ttgaaagaac 
ttagaagttc 
ttgcatggac 
aatattcttc 
gcttctcctg 
gctatggatg 
attgaattgg 
cacattgccc 
agatttgttg 
ttaaggcatg 
aggacaaaag 
cttttccaag 
gaacatggaa 
agcatgtccg 
gacgagagca 
tcctccgtcg 
cccaggcctg 
agagagcgct 
gagcaggaga 
cagaagcagc 
aagctacaga 
gccaagaagc 
aagttccagc 
cagaagaagc 
agcacaccca 
acacaggctg 
tgtcgtttct 
cgggaggaac 
cacgacgagt 
atggatctga 
aggcgagaaa 
ttaaaggcca 
aaacagtata 
acaatcttaa 
tatigaacaga 
gaagcagaat 
taccagagca 
gagcagagag 
gcccttcaga 
attgaaactt 
tttcctaagg 



gtgagttgca 
ctaaaatgaa 
ttaccGCcca 
ctgctctact 
cctacaaaga 
ttggagcagt 
tgtcctatag 
ttttacgaca 
acactgcttg 
ataaaaaacc 
tagcctacct 
taacagagcc 
ccaactcctt 
aaggacagta 
cggaacggaa 
agaatgactc 
attactgctt 
actttgttcg 
atgcagttcg 
agacacggaa 
ccagcctgaa 
tgagcacagg 
gttccgaact 
tgcataagaa 
agccgcggcc 
ttgccacgat 
acgagttgcg 
tgatcgccct 
aggaggtgga 
aagtggctat 
aacagatctt 
agtataagat 
agaaagagaa 
aagaggaagc 
tcaagcggaa 
taaataaaaa 
ccacccgaga 
tccgtttaca 
gagaactgca 
tggaaatgca 
aagcactcaa 
agacactgaa 
gtataaatga 
gccaggcctt 
aaatcaagat 
tgtctctgcg 
aggaacgcag 
ttgacatgga 
aggactacag 



180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3096 



<210> 21 

<211> 1527 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc-feature 

<223> Incyte Clone 054191 

<400> 21 

ctccgcttga ggagaagcgc caagtgcgca tggggacgct atagcaattc gtttgctgtc 60 
cttcctctcc ttcgaagatg acaaggccta ccatcgtttc ttcctgcctt tgggccgtca 120 
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'I 



wo 99/58558 



PCT/US99/10567 



ggcagttggt 
tgtcatggct 
taatatacag 
cctgggactt 
catcatctgc 
caccgtcaat 
acagagcaca 
tttctcaagc 
tgaatgtttc 
agccatggct 
tggtatcgcc 
tatccgagcc 
acagtgggca 
ttgctgctgg 
caagcttcac 
caactgtgga 
gcgtgatgca 
cttaaattca 
ctacaatact 
ttgggaaagg 
atatagtgcc 
cagattcact 
gagatagact 
taaattctag 



tgggacccgc 
actctgagtg 
aaatctccag 
tgctggctca 
tataacctac 
tacatctggt 
tctccactgg 
ctgatacctc 
caaaacttca 
gaactcccac 
attttcctgg 
cagacattgt 
gtgattcaca 
ggcccagctg 
atggcccttt 
gtatatggct 
gatacccaga 
ctggaatcca 
ggaacatcca 
acatcttaaa 
attgttagta 
agtgatcctt 
cgtgttacct 
attiattiaaaa 



tccaaccctc 
ttataggttc 
agaraagacc 
cggagacact 
aagcagttgg 
atttgtacac 
tgatagatta 
tgctattgat 
gtcagagcca 
cttctgccaa 
gcagctttgt 
ataagaagtt 
ttgtggacca 
tcattctaat 
atgttctcca 
ggacgcagca 
caccattatt 
ccctgacttt 
ggaactggag 
tcttttctaa 
gtcattctag 
aaagtctcta 
tgatggarar 
aaaaaaa 



ggttcttcct 
aagttcactt 
acttttttat 
tctctatgga 
acagatattc 
agagctgagg 
tacttgtcga 
gacacctgta 
caagtgtatc 
cacatctgtc 
actcagcctc 
tgtgaagtca 
acgggtgcgc 
gatcataaag 
ggctctaacg 
caaattccac 
atgctcacag 
tcctgccagt 
ttattctacg 
ctatgcccta 
atgaattggg 
ttcagggaga 
tggatttgtc 



gcaatacagt 
attgcctatg 
ctgagcttct 
gcttcagtag 
tacatttcct 
atgaaacaca 
grtggtcaaa 
ttctgtctgg 
ttgatgcact 
tgtagcacac 
cttaccatta 
accggctttc 
ttctacccag 
ctgactaagc 
gcaacatctc 
caactaaagc 
aagagattct 
acttctacca 
ctaatggatt 
aactgcagaa 
agtatctctc 
ggaagacact 
taagtctctt 



ggatacaatt 
ctgtattcca 
gtgacctgct 
caaataagga 
catttctcta 
cccagagtgg 
tggcctttgt 
gaaatactag 
caccaccatc 
tttattttta 
tggtcttact 
tggggagtga 
tggccttctt 
cacaggacac 
agggtctact 
aggaggctcg 
atagcagggg 
ttttttgaaa 
ggaaagaatg 
ctcaaaggaa 
cagttattcc 
ttccatctca 
ctagaaaaaa 



180 

240 

300 

360 

420 

4 80 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1527 



<210> 22 

<211> 2948 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc-f eature 

<223> Incyre Clone 1403604 



<400> 22 

aaagaaaggt 

cgagatgcgc 

atgggtggtt 

cacaaccgtg 

agcgtcgggc 

tggccgcgcg 

gccgcgcacc 

tgcgcgcttt 

ctctcggcgc 

gggctgcagt 

gagagacgcc 

cgctgtgctt 

gagtggagga 

ccgaggcggt 

cccagcgttt 

ttggccagtt 

gttttcctga 

gtgccgtgga 

tggaaaatat 

tcattctggc 

aggcatttcc 

gtagcgaggg 

cagacgttat 

gccagtttca 

gttgtgggac 

agagtcgggt 

tccagcttca 

caaaggctgg 



cagccgcaag 
tcttcccgga 
tttttccccc 
acagactgcc 
aggcgcggta 
cgggracgrg 
gcrcat tcac 
tttggagcag 
tggcttcgac 
ccgggaggtg 
agagctgrgc 
tgagagcgca 
ggccctgggc 
gctgacctac 
tcctaatgcc 
catgctgcaa 
cgtggaggcg 
cataaatgaa 
tgaacccttt 
agcttctagg 
tcgcgtaaat 
ccaggtccca 
tctcagtgca 
cttgctctca 
tggagttcag 
tctggttctg 
tttttttaag 
ccgaaaggat 



cgaacttagc 
agtgacgcac 
taccgaggtc 
attcctgagt 
cagaacacca 
caggacccct 
cgaggctact 
attggcgcgc 
tcgctctatr 
gattttccgg 
gcgttaaccg 
gactactgca 
gccgcggggc 
ctcgagccgg 
cttttcgtgg 
cattttcggc 
cagcggcgcc 
ttctatcact 
gacgaatttg 
ggagacaccc 
cctgcttcgc 
aacctgaaga 
ggaggatttg 
agagattgtg 
tgggatggac 
ggagggagac 
agtgaggata 
gattccactt 



actggctaca 
aagtgccggc 
cgtgaggtgt 
ctcttctggc 
acgacagcag 
ttgccgcgtt 
acgtccgcgc 
cccaggccgc 
ttcgcttaaa 
acgtggcgcg 
ggcctttcga 
tcctgggcct 
tcgacgcagc 
agagtgccgc 
tctatgagca 
agctaaactc 
gcttccttca 
gctttcttcc 
aggagtggca 
tctcccacac 
cttcaggggt 
gatatggcca 
gagagcagga 
actctgaatg 
gcctttatca 
tgtccccagt 
ataacactga 
tgtgttgttg 



ccctcctcaa 
ggaaggggaa 
gtgctaacca 
catgggcccc 
cgccctcagc 
gcnggttccg 
acgcgccgtg 
gcttcgcgcg 
aaccgcgggc 
gcgcaaagca 
gaggggggag 
ggacttgcgg 
ctcacccact 
ggccctcatc 
gatgaggcct 
ccccctgcat 
agctggctgg 
cgcagaagaa 
Ectgaagtgc 
cctagtgttt 
attccctgcc 
cgcctctgtc 
ggggcggcac 
gaaaggcagc 
caccatgaca 
aagtccagcc 
ggacctgaaa 
gcggcattca 



ttctggttgg 
gtccaggagc 
aggggcggct 
cggagccgtg 
aagcgttccc 
ggcgcggcgc 
aggcactgcg 
cagaccutgr 
cgcctggccc 
gaaaggattg 
cccgcgtccg 
cagctccagc 
ctgctcctgg 
gcctgggcag 
caagacgcct 
ggcctggagc 
accgcctgcg 
cgccggcggg 
gcccattatt 
ccatcctcag 
agcgtagtca 
ttcttgagcc 
tgccgagtga 
caaataggca 
agactctcag 
ttgggggttc 
gtgacaataa 
acaacagaag 



60 
120^ 
180-'- 
240^ 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
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wo 99/58558 PCT/US99/ 10567 



tgtcctgtca gaatcaggaa 
tactaagtga ctggcatttc 
agggagaagt acctgaagcc 
ttattgctgg aggtctcggg 
caatctcttg tggattcctc 
actcccacac agctcatgtg 
attcctcctc atttcctgga 
atcagattga cacaacatat 
ttcttcctga agagcaacag 
gtacctactt caacccccat 
aaggactgga ctaatattca 
atgaccctct agctatagat 
actattcagt gcaaaaagtg 
actacagaaa acagctgaag 
taattttcta atctaagttc 
ttcccaatcc caggccttgt 
gtggttgaat atatttgaga 
tatagttgag ggagcgtagt 
tcacagtttg attaaaacat 
aacaggaaac acttgtagga 
ttccatattt atttagattt 
aaaaaaaa 



tatttgtttg tgtatggggg 
ctccatgtag ggacaacggc 
cggcattctc acagtgcctg 
gcttctgagg agccattgaa 
tgggagtcag tagacatcca 
ctcaatggaa agctgttact 
gtgactgtga tcaatttgac 
gtgccatggc cattaatgtt 
ctcctgctcc ttggaggtgg 
acagtcacat tagacctttc 
ggacccacta aagtagacaa 
actgccactc crcctttccc 
aaaaaggttg gtaaaatagg 
acagtggcca tgcagtccga 
gtttattgag tracagtggt 
gatttgagat ggtaccttag 
tggtacctta gaaaaagttt 
attgacaatt cttcatgtag 
tgtcctgttt gttgcaacag 
cttcctttac caacatactt 
ataagtgtca ataaagcaaa 



tcgaagcgtg gtggaacctg 1740 
ttgggtcagg atcccagtgg 1800 
cacttggcaa gggggagccc 1860 
ctctgtgctc tttctgagac 1920 
gcctcccatt accccaaggt 1980 
ggttggaggg atctggattc 2040 
tacaggattg agctctgagt 2100 
acacaaccat actagtatcc 2160 
tgggaactgc ttttcctttg 2220 
ttccttaagt gctgggcagt 2280 
taaagttttc cacaaatagg 2340 
catccttttt ttcccttagc 2400 
taaaatacct agaaacaatc 2460 
gaggagtagt ggtctgcctc 2520 
ctttagtaaa gtaaaacaat 2580 
aaaaagttac acgcagttcc 2640 
cacgcagatc cttggttgaa 2700 
gaaacctgaa atgaacacag 2760 
aaaactcgga tagttttaac 2820 
tttaaatgtt ttgctattgg 2880 
cttttgatgc ctcaaaaaaa 2940 

2948 



<210> 23 

<211> 1808 

<212> DMA 

<213> Homo sapiens 

<220> 

<221> misc-feature 

<223> Incyte Clone 1652936 



<400> 23 

gagagtatta 

atacgatttg 

acagaaaaaa 

acgatgtcac 

agccatggct 

gctaggcagc 

cttcggca tc 

agcagttacc 

cagrtccgat 

ttcttcacca 

gctgactgtg 

gctcggaaga 

ctgcaggcag 

tcactgccgg 

atggctgacc 

agtacratca 

tccaacagtc 

gccatgaaaa 

agagcctcct 

ggattgttac 

gtgtcggagg 

agtccagcaa 

ttcagaccag 

gcttctgcgg 

aaggccattc 

aacagtgagc 

caattggaac 

gggtgacctt 

gatgtttgac 

tgtatagact 



aatgtgaatt 
ggaacccagc 
attccttcaa 
ccctcgcctc 
acagcaaact 
cccttgtcca 
aggcttccct 
ccgacgaacc 
cccttcagcc 
aatcaaagct 
atggagccct 
acggctaccc 
cttttcctaa 
caaatcaaca 
ttcctgtccc 
atgaagcctt 
tcaaagcaag 
ggggcgagga 
ccttggatct 
ccccaccacc 
ccgagttact 
agaaggatgt 
aaaaccaagc 
caaccatgaa 
aaactgctat 
tccagcagca 
aacttcgtcc 
gtctgccaag 
tgtgtcaaaa 
taaaaaaaaa 



gcctctgcct 
tgagctgcat 
aagaatggac 
ccctccttct 
gcggagcagc 
gcccgaggga 
tatccggtcc 
ctggaggata 
agacccaagc 
ttccattcca 
gaccctgcct 
attgcctgag 
gcccaaatgg 
acctcgtgac 
taaccaggat 
accaaaggac 
accaagatcc 
ccctcccacc 
gaataaagtc 
tgcgctccct 
cccacagctg 
actgtattct 
tacagaaaac 
accgcatcca 
ccgcaaaaat 
gctcaaggag 
ggtcactgtg 
atctttcttt 
gctgtgagca 
aaaatagatc 



cgctttatga 
ggaactaagg 
gatgaggata 
tccccgcctc 
gcagaacaga 
tcctcatcag 
ttttcagtgg 
acagaagaac 
tctttcattt 
gaactctcct 
gagttctgtg 
ggcctccctc 
gactgtcaat 
ttgaatcgga 
gtaactagtg 
gtgtctgagg 
agatcttact 
ccgccacctc 
ttccagccca 
ccaagacctt 
agcagagccc 
cagccaccat 
caagagcctt 
acagtccaaa 
aaagaggcaa 
gttcatcaag 
ttgtgacccc 
tgaatgtttt 
gcaaaatata 
tttaattaag 



tgtcaaagaa 
ttcagattcc 
aacaggaaac 
attaccagag 
tgcatccagc 
ggggcccagg 
agagggaact 
agcgcgagta 
caggrtctgt 
atatatggga 
ctgcgtttca 
caactctgca 
tatttgattc 
tggagacatc 
atgacaaaca 
atccagcaac 
ctagcacctc 
ggccacagaa 
gtgtgccagc 
gtccatcaca 
catcccaggc 
caaagcccat 
ccactgctgc 
agcagtcttc 
acgcagtgct 
aacgaattgc 
cccatggttc 
gaacccaact 
atccatatga 
cggtcgcaag 



tgatggtgag 
atatttaact 
acagtctccc 
ggtgcccttg 
accttatgaa 
aaccaagccc 
acaggataac 
ctatgtcaai: 
ggccaagaac 
gcttagtgat 
tctcattgtg 
gccagaatac 
ttattctgag 
tgttaaagac 
agctttgaaa 
tcccaaggat 
catagaagag 
aacccattcc 
taccaagtca 
gtctgaacaa 
tgcagaaagt 
tcgtaggaaa 
aagtgggcca 
caaacagaag 
ggctcggctg 
attggaaaac 
aagtgacagr 
acttgtcata 
ccttttctct 
cttattccct 
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ttagtgag 



1808 



<210> 24 
<211> 1148 
.<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc-feature 

<223> Incyte Clone 1710702 



<400> 24 

tgcgtacgtg 

tcaggcgagc 

ggctcggccg 

acgtattttg 

gtgcccttgg 

atcactcagg 

cgaggaatac 

tctggagatg 

ttcatctgtg 

cccaacattg 

accgagcctg 

ttgctgtgtg 

ggagttccgt 

ggacaagtgc 

gcccgggcca 

gaggacacct 

gtggaacact 

gattcctcag 

agcagtgtgt 

cacccaaa 



cgtcgtctct 
cgatctgggg 
ccgccctgtg 
gcgtggttca 
cccagcgcca 
tccactggtg 
agatgtacga 
cctccccagt 
tgggaacgtg 
tactgagcga 
cccagggaca 
tctggcggtc 
gcccctctgt 
atctatatga 
tctgcgccct 
ttgtgcatat 
gtcatggtga 
gcaactcctt 
gagaagagca 



atggtggcgg 
ctgcaggatg 
caacaacctc 
tggaccaagc 
gctccacgct 
tgtcctcccc 
gtccaatggc 
acaggctgtg 
gtcaggccgg 
ggagctggct 
ggattgtgtg 
agggccagaa 
gcagctgtgg 
ggccactaca 
ggacctggct 
ctggaagctg 
gtgtgtcgcc 
tgctgtgact 
gccttccttt 



cggatttgga 
ttccgctggg 
agtgtgctgc 
gcccagcttc 
aaggagggtg 
ttccgagtgc 
tacaccatgg 
tttgcccggg 
gtgctggtgt 
gggcaccaga 
gctgacatgg 
ttcacattat 
caggggatca 
ggaaatctac 
tctgaggtgg 
agcagaaacc 
gacacccagc 
ggctatgacc 
gtccctgtgg 



gggaccctac 
agcgctccat 
agctgccggc 
tcagcgctgc 
ctggagtgag 
tgctggtact 
tctactggca 
gaattgctgc 
ttgacatccc 
tgccaaccac 
cgacggcaga 
tgacccgca t 
tagcagcagg 
atgtccagat 
gcaagctact 
cagagagtgg 
tgtgtggtgc 
ttgcggagat 
tattcataaa 



gaaccaggag 
tcccctgcga 
tcgcaacctc 
tcctgagggt 
tcccccactt 
cacctcacat 
tgcactggac 
cagtggccac 
agcaaagggt 
agacattgcc 
tgactcaggc 
tccaggattt 
ctatgggaac 
caatgccca t 
ctctgcaggt 
ctacattgag 
tcgattttgt 
ccggagattc 
gtacccgctc 
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<210> 25 

<211> 2419 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc-feature 

<223> Incyte Clone 3239149 
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cggacgcgtg 

tcggcgatta 

tcctcagaga 

aattcactta 

tgtttagcaa 

ggcacttcca 

aaggaactgt 

gaacatctta 

cctatgttgc 

gagaacattc 

gaatggtacc 

aggacagatt 

aaaaacaaac 

aaaattatac 

cagagaattc 

cagaaaatag 

gaatgcaagc 

agagtgatca 

gaaatgctaa 



ggggcctggc 
tggacccggc 
gagaagactg 
gacagacata 
gcactgctat 
gtatgattgt 
gtgtcaaata 
tatcccaaat 
agagagattt 
tgtcatacct 
gagtgacctc 
ctctgtggag 
ctcctgacgg 
aagacattga 
actgccgaag 
taagcggcct 
gaattctcac 
taacaggatc 
acacgttgat 



accaaagggg 
cgaggcggtg 
taataatggc 
caacagctgt 
gaagactgag 
gcccaagcaa 
ctttgagcag 
gtgtcattac 
cataactgct 
ggatgccaaa 
tgatggcatg 
aggcctggca 
gaatgctcct 
gacaatagaa 
tgaaacaagc 
tcgagacaac 
aggccataca 
atcggattcc 
tcaccattgt 



cggccccggc 
ctgcaagaga 
gaacccccta 
gccagactct 
aattgtgtgg 
cggaaactct 
tggtcagagt 
caacatgggc 
ctgccagctc 
tcactatgtg 
ctgtggaaga 
gaacgaagag 
cccaactctt 
tctaattgga 
aaaggagttt 
acaatcaaga 
ggttcagtcc 
acggtcagag 
gaagcagttc 



ggagagcgga 
aggcactcaa 
ggaagataat 
gcttaaacca 
ccaaaacaaa 
cagcaagcta 
cagatcaagt 
acataaactc 
ggggattgga 
ctgctgaact 
agcttatcga 
gatggggaca 
tttatagagc 
gatgtggaag 
actgtttaca 
tctgggataa 
tctgtctcca 
tgtgggatgt 
tgcacttgcg 



cccagtggcc 
gtttatgaat 
accagagaag 
agaaacagta 
acttgccaat 
tgaaaaggaa 
ggaatttgtg 
gtatcttaaa 
tcatattgct 
tgtgtgcaag 
gagaatggtc 
gtatttattc 
actttatcct 
acatagttta 
gtatgatgat 
aaacacattg 
gtatgatgag 
aaatacaggt 
tttcaataat 
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ggcatgatgg 
actgacatta 
tttgatgaca 
agtacttgtg 
tacagggaca 
gaatgtggtg 
tttgataaca 
gtggctgctt 
cattccggaa 
gatgacacaa 
cgttcccctt 
acttgcccag 
aacaacagta 
tgagactcct 
actgactgct 
ttttaaacct 
cagacaaagg 
ggccattaag 
ccactctagg 
tccccggtta 
ataggtctag 
aagggaaaca 



tgacctgctc 
ccctccggag 
agtacattgt 
aatttgtaag 
ggctggtagt 
catgtttacg 
agaggatagt 
tggacccccg 
gagtttttcg 
tcctcatctg 
ctcgaacata 
gacccattaa 
acaatcaaac 
gttgggacac 
tcagtgctgc 
cccctcctct 
tgacttataa 
gcaggaggaa 
gggtaggggg 
actgcagggg 
gggaggggga 
caggatttg 



caaagatcgt 
ggtgctggtc 
ttctgcatct 
gaccttaaat 
gagtggctca 
agtgttagaa 
cagtggggcc 
tgctcctgca 
actacagttt 
ggacttccta 
cacctacatc 
agttgcggta 
tactgcccag 
agttggtctg 
tatcagaaga 
cctcctttca 
atatatttag 
ctaggtttcc 
gggggtgnca 
atgtccagga 
cagagttccc 



tccattgctg 
ggacaccgag 
ggggatagaa 
ggacacaaac 
tctgacaaca 
ggccatgagg 
tatrgatggaa 
gggacactct 
gatgaattcc 
aatga tccag 
tccagataaa 
tttaacgtat 
tttccctgga 
cagtcggccc 
tgtcttctat 
cctctgcacc 
gtgttttngc 
cctgtatagg 
gctttcaagg 
ccgggggggc 
ctcgtaatag 



tatgggatat 
ctgctgtcaa 
ctataaaggt 
gaggcattgc 
ctatcagatt 
aattggtgcg 
aaattaaagt 
gtctacggac 
agattgtcag 
ctgcccaagc 
taaccataca 
ctgccaatac 
ctagccgagg 
aggacggtct 
cttttgtgaa 
tagttttttc 
ccaggaatct 
gcctgcgggg 
cccaggggcc 
tacgagcaag 
ggctcggggg 



ggcctcccca 
tgttgtagac 
atggaacaca 
ctgtttgcag 
atgggacata 
ttgtattcga 
gtgggatctt 
ccttgtggag 
tagttcacat 
tgaaccccce 
ctgacctcat 
caggatgagc 
agcagggctt 
actcagcaca 
tgattggaac 
ccattggttc 
ctcttgcttt 
ggagaggacc 
ccaggtgtct 
gcccggcccc 
agggcaggga 
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<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc-f eature 

<223> Incyte Clone 3315936 



<400> 26 

atttaatatg 

ttgcttagac 

aacaagcaaa 

ctagcttcaa 

atcaacactg 

tcctactggg 

gatcacagcc 

gtacttcctc 

gatatccatg 

ggaacccacc 

ttgattgcac 

cgactgagat 

aaaaattaag 



actcactata 
tgcggcccac 
tcctagtgct 
acagagtcca 
aagacagcca 
aaatgtacct 
gattacctga 
tggttgtgtt 
aagctttggc 
tgactaagaa 
agctggctgc 
gtcatcagtg 
ccatttccta 



gggaatttgg 
gtggaaggct 
gggcctggat 
gcacagtgtg 
gatggagttc 
atccaaggga 
agccaagaaa 
tgcaaacaaa 
attatctgaa 
tggctcagag 
agatgtgcag 
ttgaatggca 
ttaaaa 



ccctcgagct 
cttagccacc 
ggagcaggaa 
gcacccaccc 
ctggagattg 
ttgctgctga 
taccttcatc 
caggarcrtg 
gtgggaaatg 
ataccctcca 
tgaccaggac 
ggcttgaagc 



agagattcgg 
ctgcctggcc 
aaaccagtgt 
aaggtttcca 
gtggcagtaa 
tctttgtggt 
agctaattgc 
aagcagccta 
acaggaagat 
ccatgcaaga 
tcagcccact 
caaaggtttc 



gcacgagggg 
cgaggagaaa 
cctgcactct 
tgcagtttgc 
accttttcgg 
ggattcagca 
agcaaaccca 
tcacattaca 
gttcttgttt 
tgccaaagac 
gtgcggctca 
cacctcaaat 
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